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PREFACE 

In the summer of 1932 the author was invited by Professor W. Lloyd 
Evans, Chairman of the Department of Chemistry, Ohio State Uni- 
versity, Columbus, Ohio, to give a series of lectures on quantum me- 
chanics. For the opportunity thus afforded him for study of this 
subject in a university atmosphere the author wishes to express his 
gratitude to Professor Evans. 

The notes prepared for these lectures were subsequently published 
as a serial in the Journal of Chemical Education (May 1935 to August 
1936, inclusive). To the editor, Dr. Otto Reinmuth, the author is in- 
debted for many helpful suggestions with regard to method of presen- 
tation and contents. 

Since no reprints of the series were made available it was suggested 
that the contents be revised for publication in book form. As stated 
in the first chapter, the writer's aim has been to present the subject in 
such a manner that its essential concepts and logic may be readily compre- 
hended by those who have not had any intensive training in mathe- 
matics beyond calculus. For this reason there has been presented in 
a number of cases a great deal more detail of the mathematical develop- 
ment than would seem necessary to those readers who are familiar with 
more advanced branches of mathematics. 

The author lays no claim to being an expert in the field of quantum 
mechanics. But, like many other workers in science, he has felt a strong 
desire to learn something about its technic and applications. This 
volume may therefore be regarded in a sense as a series of notes which 
have served to clarify, at least to his own satisfaction, some of the diffi- 
culties which he, together with probably a considerable number of other 
students, has encountered in attempting to understand the subject. 
Should the contents of this volume prove of any assistance to others in 
enabling them to proceed with the study of more advanced treatises, 
he will feel amply rewarded for a task which has indeed been a source of 
intellectual pleasure. 

He also wishes to take this opportunity of expressing his appreciation 
of the sympathetic support of Dr. W. D. Coolidge, the Director of the 
Research Laboratory of the General Electric Company, in a task which 
could scarcely have been carried through without it. 
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To Dr. Frederick Seitz of this Laboratory the author is deeply grateful 
both for helpful discussion and clarification of difficult topics and for 
reading a considerable part of the galley proof. 

Finally the author wishes to express his indebtedness to Miss Elizabeth 
Gage for invaluable assistance in the typing of the manuscript and in the 
even more tedious task of proof-reading. 

SAUL DUSHMAN. 

RESEARCH LABORATORY 
GENERAL ELECTRIC COMPANY 
Schenectady, New York 
October 7, 1937. 
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ERRATA 

NOTE: The author will appreciate having his attention drawn to any errata in 
the text other than those mentioned in the following list. 

Page 

38 j In the second equation from the bottom insert minus sign before the right- 

hand side. 
49 In equation (8) replace a in exponent by ft. 

66 The equation AA + BB = 1 should read AA - BS = 1. 

t 12 

67 In the last two equations at the bottom of the page replace - by - 

ft ft 

68 In the table at the top of the page the heading of the first column should 

read 2a X 10 8 . 

84 and 85 In the footnotes Appendix III should read Appendix IV. 
87 In equation (31) replace the factor Xi by ii. 

115 In the ninth line from the bottom the lower limit of integration should 

read 0. 

116 Near the top of the page the second line of the first equation should read 

-2 2a n - 2 . 

125 In the equation near the top of the page insert the factor c 2 in the right- 

hand side. 

1 2 

142 In equation (7b) change the coefficient - in the last term to - 

r r 

154 The last expression in the last equation should read 



x 2 x 
187 In the exponent of replace by - in the three equations in wmcn IT, 

occurs. 
225 Equation (23) should read 



228 Replace z\ and 22 by q\ and 92, respectively, in the equation for F, and 

similarly in lines 7, 8 and 9. 
231 In the right-hand side of equation (376) insert the factor eft. 

234 Replace dyne cm. 6 by dyne cm. 7 . 

235 In equation (46), insert the factor in the middle term. 
389 In equation (16) replace V^ by V$. 
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CHAPTER I 
QUANTUM PHENOMENA 

1.1 Introductory Remarks. A little over a third of a century has 
passed since Lord Kelvin, in an address before the British Association, 
pointed out that there were apparently two clouds upon the scientific 
horizon. One of these was represented by the experiment of Michel- 
son and Morley; the other involved the failure of classical theory in 
accounting for the observations on the energy distribution in the radia- 
tion emitted by a black body. The first difficulty led Einstein to formu- 
late his special theory of relativity, and, subsequently, a more generalized 
form of the theory which involved a radical interpretation of the force 
of gravity. The second difficulty led Planck to formulate a theory of 
energy quanta which, through the work of Einstein, A. H. Compton, 
N. Bohr, and others, has led to a corpuscular theory of the interaction of 
matter and radiation. This quantum theory entered upon a second 
phase in 1926 with the discovery of the undulatory nature of corpuscular 
motion, and, through the theoretical investigations of Heisenberg, 
Schroedinger, and Dirac, there has been developed a totally new point of 
view on the nature and behavior of electrons, atoms, and molecules. 

The system of concepts and mathematical technic originated by these 
investigators, together with the applications of these new methods to 
physical and chemical problems, constitute what has been designated 
as the new Quantum Mechanics. 

Although a more comprehensive discussion of this theory requires a 
considerable knowledge of mathematical technic with which most 
chemists and many physicists are not familiar, the writer believes that 
the essential features of the new point of view may be presented without 
recourse to such highly intricate mathematical methods. It is possible 
to obtain an understanding of the " physical " ideas by the aid of com- 
paratively simple mathematics. Furthermore, it is not necessary to 
follow the same methods of presentation of these ideas that were used 

1 



2 QUANTUM PHENOMENA 

at the beginning by the pioneers in this field. This avoids the introduc- 
tion of concepts which are both difficult to grasp and probably unes- 
sential, at least in the initial stages, for a comprehension of some of the 
basic principles and deductions. Professor E. T. Bell 1 has spoken of the 
" metaphors of quantum physics," and if we regard the mathematics 
of the new quantum theory as merely a symbolic language for the 
interpretation of physical phenomena, and not as a representation 
of the actual processes involved, then we shall find that a great 
deal of the apparent mystery disappears. 

After all, it is essential to realize that quantum mechanics is merely the 
most convenient type of language which has been evolved, so far, for the 
representation of a large number of observations in physical science 
which have accumulated during the past three decades. It is a language 
in which there is a one-to-one correspondence between certain symbols 
and certain observations, and the mathematical technic constitutes the 
most logical method for deriving from these observations such conclusions 
as may be subjected to further experimental tests. 

Consequently, it is essential that we should first consider carefully the 
actual observations which have led to the new point of view. The 
relationship that should exist between observations and their inter- 
pretation is one that has not always been clearly defined. It is com- 
paratively easy to confuse the shadow with the substance, and what is 
often intended by the theoretical physicist as a working analogy is 
assumed by others to be an actual physical model of the new phenomena. 

Some twenty years ago, H. Poincard, one of the greatest mathematicians 
of that period, stated his opinions on this point in a work entitled 
" Science and Hypothesis." " Experiment," he wrote, " is the sole 
source of truth. It alone can teach us anything new; it alone can give 
us certainty. But to observe is not enough. . . . The scientist must set 
in order. Science is built up with facts, as a house is with stones. But 
a collection of facts is no more a science than a heap of stones is a house," 

The scientist attempts to generalize from these observations, and 
thus sets up a theory so that he may be able to predict the results of 
new experiments. As has been stated in an address by I. Langmuir, 2 

Our theories consist fundamentally in the setting up of a model which has properties 
analogous to the phenomena which we have observed. For example, Bohr, taking 
into consideration certain properties of hydrogen atoms, proposed a model for these 
atoms which consisted of an electron revolving in an orbit about a nucleus. The 
energy changes that would take place in this model, according to calculation, were 
found to be identical with those that were observed for hydrogen atoms. The 
theory was thus useful and served to explain the properties of hydrogen. 

1 E. T. Bell, Sci. Monthly, 32, 193-209 (Mar., 1931). 
2 1. Langmuir, Gen. Elec. Rev., 37, 312 (1934). 



INTRODUCTORY REMARKS 3 

Since any such model is an abstraction formedfor a definite purpose, it is necessarily 
incomplete and therefore the model must never be confused with the physical phenom- 
ena which it represents. We should therefore never ask whether the model repre- 
sents reality. It is sufficient to say that in certain respects it corresponds to reality. 
For example, although the equations which Bohr derived from a consideration of his 
model are still valid, we have today quite other explanations of the behavior of 
these atoms. The original Bohr model has lost its usefulness. 

Even the atomic theory of matter, which is so universally accepted today, consists 
essentially in the setting up of a model, in which chemical compounds are conceived 
of as being made up of definite arrangements of atoms, to which we assign suitable 
properties. . . . 

Most of the laws of physics are stated in mathematical terms. But a mathematical 
equation itself is nothing more than a kind of model. We establish, or assume, a 
correspondence between observable quantities and the symbols of an equation, and 
then, after a mathematical transformation, obtain a new relation or equation. If 
we can establish a similar correspondence between the symbols of the new equation 
and observational data obtained after an experiment has been performed, we have 
demonstrated the power of the mathematical theory to predict events. It thus 
becomes a useful theory. 

Above all, it is necessary to realize that analogies and models are 
always limited in their scope, and conclusions, based on their use, must 
be tested constantly by further experiment. 

In his book, " The Logic of Modern Physics," P. W. Bridgman has 
emphasized one guiding principle in the formation of the concepts for 
describing any new observations. " The concept," he states, " should 
be synonymous with the corresponding set of operations." He illustrates 
this statement by applying it to the physical concept length, and to the 
philosophical concept " absolute time," and then makes the following 
statement: 

It is evident that if we adopt this point of view toward concepts, namely that the 
proper definition of a concept is not in terms of its properties but in terms of actual 
operations, we need run no danger of having to revise our attitude toward nature. 
For if experience is always described in terms of experience, there must always be a 
correspondence between experience and our description of it, and we need never be 
embarrassed, as we were in attempting to find in nature the prototype of Newton's 
absolute time. Furthermore, if we remember that the operations to which a physical 
concept are equivalent are actual physical operations, the concepts can be defined 
only in the range of actual experiment, and are undefined and meaningless in regions 
as yet untouched by experiment. It follows that strictly speaking we cannot make 
statements at all about regions as yet untouched, and that when we do make such 
statements, as we inevitably shall, we are making a conventionalized extrapolation, 
of the looseness of which we must be fully conscious, and the justification of which is 
in the experiment of the future. 

Thus, in order to understand the function of our present theories on 
the structure and behavior of atomic and molecular systems and of 
electrons, it is essential to consider, first of all, the fundamental experi- 
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mental observations upon which these theories are based. Obviously 
it is possible, in such a discussion as the following, to mention only those 
facts which are both the most important and most readily understood. 
Furthermore, it is not at all necessary in presenting these observations 
to adhere to a historical sequence. It is much more essential to arrange 
these facts in order of increasing deviation from what might have been 
predicted on the basis of classical physics. 

1.2 Energy States of Atomic Systems. 3 The atomic and molecular 
theories were found to be useful in the interpretation of chemical phenom- 
ena, and their utility was extended by the development of the kinetic 
theory of gases. Toward the end of the nineteenth century came the 
discovery of the electron, of X-rays, and of radioactive phenomena. 
While previous theories had led to the possibility of estimating the 
concentration and size of atoms or molecules, the new observations led 
to the conclusion that the atom itself is a complicated structure composed 
of electrons and positive charges. But it was not until about 1911 that 
a first really successful theory of atomic structure was suggested by 
Rutherford, and the subsequent investigation on X-ray spectra by 
Moseley, as well as those on isotopes by Aston, led to a new under- 
standing of the periodic arrangement of the elements. 

The model of an atom, consisting of a positively charged nucleus 
surrounded by one or more electrons, represented a significant departure 
from prevalent views in physics, since, on the basis of these views, such 
an atom must be inherently unstable. Nevertheless, it was only by 
means of this theory that the facts of radioactive disintegration and 
the observations on the scattering of alpha particles could be interpreted 
satisfactorily. The next problems to be investigated were manifestly 
those of electron configurations within the atoms themselves and of 
chemical combination between atoms. 

The theory of the origin of spectral lines first suggested by N. Bohr 
in 1913 started new lines of investigations in the applications of quantum 
theory to atomic structure problems. One of the most striking of these 
early experiments was that carried out by J. Franck and G. Hertz in 
1915. They showed that, when electrons are allowed to collide with 
atoms, there is a transfer of energy to the atoms only at certain critical 
values of the energy of the electron. If we designate the mass, charge, 
and velocity of the electron by /*, e, and v, respectively, the relation be- 
tween the kinetic energy of the electron and the potential difference V 
through which it is accelerated, is given by 



Ve. (1) 

3 See references to collateral reading at the end of the chapter. 
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Franck and Hertz found that this kinetic energy could be transferred 
completely to an atom only at certain critical values of V (critical 
potentials), thus indicating that for each type of atom there exists a 
certain series of discrete energy states. For collisions of atoms with 
electrons having an energy less than the lowest of these critical values 
(which we shall designate by F r ), the laws of elastic collisions apply, 
while for energies above the value V r , 4 the electron loses only that 
amount of energy which corresponds to the next higher critical value. 

A second observation, made by Eldridge in America and also by 
Franck and Hertz, was that an atom in the state corresponding to V r 
is able to emit a monochromatic radiation of which the frequency v is 
proportional to V r in accordance with the relation 

hv - V r e, (2) 

where ft is Planck's constant. It was also observed by these investigators 
and others that the frequency of any line, in the spectrum of an atomic 
system, could always be represented by a similar relation of the form 

hv - (^ - 7 a ), (3) 

where V\ and 7 2 denote the energy values in volts corresponding to two 
different critical states of the atom as determined by bombardment with 
electrons. 

Figure 1 illustrates these observations in the case of collisions between 
electrons and sodium atoms. As long as the energy of the electron is 
less than 2.10 volts, the collisions are perfectly elastic, and, in accordance 
with the laws of ordinary mechanics (applied to the collision between 
two particles), the electron loses only an insignificantly small fraction 
of its energy, namely, 2p/M, where M = mass of atom. (In the case 
of sodium, 2/i/M = 4.8 X 1(T 6 .) At 2.10 volts, or a slightly higher 
kinetic energy of the electron, an inelastic collision occurs. 

The electron transfers a fraction of its energy, corresponding to 2.10 
volts, to the sodium atom and under suitable conditions it will be ob- 
served that the vapor emits the two D lines of sodium of wave lengths 5890 
and 5896. In other words, the 2.10 volts kinetic energy of the electron 
is used in exciting the sodium to the first excited state, which, as is shown 
in Fig. 1, is designated spectroscopically as 3P, and when the excited 
atom returns to the normal state, the radiation corresponding to the 
D lines is emitted. (Actually the 3P state consists of two states of 
slightly different energy contents hence the emission of two lines of 
nearly the same wave length.) 

4 The energy value is, of course, We; but it is customary to designate energy values 
in terms of electron volts, that is, the value of V as defined by equation (1). See 
Appendix II for values of universal constants and conversion factors. 
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As the electron energy is increased beyond 2.10 volts and maintained 
below 3,18 volts, any inelastic collision between an electron and a sodium 
atom results in the transfer to the latter of only 2.10 volts energy, while 
the excess over this value is retained by the electron as kinetic energy. 
At 3.18 volts, the electron can excite a sodium atom to the 4S state, and 




-40,000 



45,000 
FIG. 1. Energy levels and lines in arc spectrum of sodium. 

from this state the only transition which can occur is that to the 3P state, 
with the accompanying emission of the infra-red lines XI 1,382 and 
XI 1,404. (Transitions can occur only between S and P states, and P 
and D states, but not between states designated by similar letters.) 

Thus, as the kinetic energy of the electron is increased, it becomes 
possible to excite the sodium atom to successively higher energy states 
(or "levels")) and the spectrum changes from a narrow doublet, 
observed when V is below 3.18 volts, to one consisting of an increasing 
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number of lines, until finally, when V becomes equal to or exceeds 5.12 
volts, all the lines in the arc spectrum of sodium appear. Observation 
shows that, at this latter voltage, ionization occurs, with formation of 
Na + and an electron. 

Even before the advent of the Bohr theory, spectroscopists had 
recognized that in some of the simpler spectra at least (such as those of 
H, the alkali metals, etc.), it is possible to represent the frequency v of 
any line in the form 



(4) 
my 

where R is a universal constant (the Rydberg constant) and a is a 
constant for a given series of lines, while m increases by integral values 
with increase in v for the different members of the series. In terms of 
the Bohr theory, the values R/a 2 and R/m 2 correspond to " energy 
levels," and in Fig. 1 a number of such energy levels are shown together 
with the spectral lines from which the energy levels have been deduced. 
Instead of designating the values of these levels in terms of the frequency, 
spectroscopists have used the wave numbers v = v/c, where c = velocity 
of light. Thus, the wave length of any line is given in terms of the 
wave numbers of the corresponding levels by the relation 

- = h - v 2 (cmr 1 ), (5) 

where v\ is the wave number of the lower and 2 that of the upper level, 
between which the transition occurs. In Fig. 1, the wave numbers of 
the different levels have been indicated, and it will also be observed that 
each column corresponds to energy levels belonging to the same spectral 
series. From the difference in wave numbers A? for any two levels, the 
corresponding electron volts may be derived by the relation 

(6) 
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1.3 Resonance Radiation. While the first excited state of the sodium 
atom may be obtained by impact of a sodium atom with an electron 
having an energy greater than 2.10 volts, the same result may also be 
obtained by allowing the D lines from a sodium- vapor lamp to strike a 
heated bulb containing sodium vapor at low pressure and no electrodes 
whatever. The sodium atoms absorb the radiations X5890 and X5896, 
and thus become excited. On returning spontaneously to the normal 
state, the same wave lengths are reemitted and the bulb containing the 
vapor glows with a faint yellow color. The analogy with resonance 
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phenomena in other fields of physics has led to the designation of this 
radiation as of the resonance type, and the value 2.10 volts required to 
excite sodium to the state at which it will emit this resonance radiation is 
therefore known as the first resonance potential 

By noting the voltages at which electrons lose kinetic energy by 
inelastic collisions with the atoms, and also by observing the wave 
lengths of the resonance radiation, it has been found possible to deter- 
mine energy-level diagrams for most of the elements, not only in their 
normal states, but also at different stages of ionization. The importance, 
however, of all these observations from the point of view of the quantum 
theory is that they lead to two conclusions, which, as a matter of fact, 
were stated by Bohr in his original paper in the form of the following two 
postulates: 

A. An atomic system can, and can only, exist permanently in a certain series of 
states corresponding to a discontinuous series of values for its energy, and conse- 
quently any change in the energy of the system, including emission and absorption of 
electromagnetic radiation, must take place by a complete transition between two 
such states. These states will be denoted as the "stationary states" of the system. 

B. The radiation absorbed or emitted during a transition between two stationary 
states is monochromatic and possesses a frequency v , given by the relation [analogous 
to (3)] 

hv = E n - E m , (7) 

where E n and E m are the energies of the two different states. 

These observations thus indicate that in the interaction of matter and 
radiation the magnitude of the energy interchange is measured in terms 
of a unit, or quantum as it has been designated, which is proportional to 
the frequency v. That this involves an atomistic view of the nature of 
radiant energy is deduced from another series of investigations, viz., 
those on the photoelectric and inverse photoelectric effect and on the 
Compton effect. 

1.4 Photoelectric Effect* In the emission of electrons from metals by 
the incidence of radiation, it has been observed that the energy of the 
emitted electrons is proportional to the frequency of the radiation, and 
not to the intensity. If we let W denote the work required to pass the 
electron through the surface, then, according to Einstein, the maximum 
energy of the emitted electron is given by the relation 

^ = hv-W, (8a) 

where v is the frequency of the radiation used. The velocity of the 
electrons is measured, in general, by the magnitude of the retarding 
potential voltage V required to decrease the velocity to zero, in ac- 
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cordance with equation (1). Consequently, (8a) may be written in 
the more usual form 

Ve = hv - W 

(86) 



where *>o - W/h is the minimum frequency which causes photoelectric 
emission from the given surface and is therefore known as the " photo- 
electric threshold." 

Equation (86) thus leads to the conception that the electron takes 
up a quantum of incident radiation, of magnitude hv f and uses it partly 
in passing through the surface, and partly in the form of acquired kinetic 
energy. Such a relation is inconsistent with any theory of spreading 
wave fronts of light. A single atom on the emitting surface is apparently 
able to concentrate the radiant energy incident on an area a million or 
more times greater than the atomic cross section, into a single unit (or 
quantum) and then utilizes this energy to eject an electron. 

Similar observations have been made on the ionization of atoms by 
X-rays. Apparently the X-rays are capable of passing over billions of 
molecules without losing any energy, and then, by accident as it were, 
one molecule absorbs the energy of a whole train of X-ray waves with 
the resulting ejection of an electron (which constitutes the process of 
ionization). Furthermore, in this case also we find that the relation 
between frequency of X-ray radiation and velocity of emitted electrons 
is given by Einstein's relation, equation (8a). 

The inverse photoelectric effect is another illustration of the applica- 
tion of the same relation. If a stream of electrons is directed against 
any solid, as in an X-ray tube, the maximum frequency of the radia- 
tion emitted increases linearly with the voltage in accordance with 
equation (86). 

These observations on the relation between radiant energy and kinetic 
energy of electrons are quite in disagreement with predictions based 
on the undulatory theory of light. " The effects," as Sir William Bragg 
has pointed out, " are as if the energy were conveyed from place to place 
in entities, such as Newton's old corpuscular theory of light provides." 
In other words, whereas the observations on interference and diffraction 
lead quite logically to a wave theory of light, the quantum phenomena 
discussed in the previous paragraphs can be interpreted only in terms 
of a corpuscular theory; that is to say that, in the interaction of radia- 
tion with electrons, the former behaves as if it were constituted of light 
units, or photons, as they have been designated. On this point of view 
we assume that these photons are guided by the electromagnetic waves 
and that what is distributed uniformly along the wave front is not the 
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energy but rather the probability of occurrence of a photon. For light 
waves of such intensity that the area covered by a million atoms is re- 
ceiving one quantum of energy per unit time, there is a probability of 
one in a million that any one atom will be bombarded in that interval 
by a photon, with the resultant ejection of an electron. 

1.5 The Compton Effect. This corpuscular conception of the nature 
of radiant energy was utilized by A. H. Compton in 1923 to interpret 
some very significant observations on the scattering of X-rays by solid 

bodies. 

When X-rays impinge on matter, secondary rays of slightly longer 
wave length, that is, of lower frequency, are produced. This result is 
distinctly different from that observed for ordinary or visible light and 
could not be understood on the basis of any classical wave theory. 
However, A. H. Compton suggested an interpretation based on the 
corpuscular theory of light which has met with signal success. 

If the incident X-rays of frequency v be considered as a stream of light 
particles or photons, then each photon carries an amount of energy 
E ** hv y and possesses a momentum P, which, in accordance with the 
observations on light pressure, is given by the relation P = hv/c, where 
c is the velocity of light. When a photon collides with a free or loosely 
bound electron, there occurs an interchange of both energy and momen- 
tum in accordance with the laws of 
elastic collision for particles. Con- 
sequently, the photon suffers a re- 
coil in one direction with loss of 
momentum and decrease in energy, 
while the electron moves off in 
another direction with added mo- 
mentum and increased kinetic en- 
ergy. Such a collision is illustrated 
in Fig. 2, where 6 is the angle be- 
tween the direction of the incident 
and that of the scattered photon. 

Illustrating the theory of the NQW the interest i ng p oint } s that, 
Compton effect. ^^ ^ distributim of mlms of e 

is governed by a law of probability, the relation between decrease in fre- 
quency of the photon and the value of for any individual collision is 
that calculated on the basis of the laws of conservation of energy and 
momentum. 

Here, then, we have a phenomenon which, like the photoelectric 
effect, can be explained only in terms of a corpuscular theory of energy. 
Yet, in all these observations, use is made of the wave theory to deter- 
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mine wave lengths and frequencies. We are thus ledto adopt a dualiatic 
conception of the nature of radiant energy. When dealing with inter- 
ference, diffraction, and polarization phenomena, we find it necessary to 
use the undulatory theory ; when dealing with the interaction of radiation 
and matter, it is necessary to use the corpuscular concepts, energy and 
momentum. The two apparently contradictory aspects are connected 
by the extremely significant relations 

E - hi,, (9) 

and P---J- (10) 

c \ 

1.6 Undulatory Phenomena Associated with Corpuscles. While 
de Broglie and Schroedinger had already suggested that matter also 
might partake in a similar dualistic behavior, the first experimental 
evidence for this idea was obtained by C. J. Davisson and L. H. Germer. 
In 1927 they made certain observations on the reflection of electrons 
from single crystals of nickel, which could be interpreted only on the 
assumption that under these conditions a stream of electrons possesses 
undulatory properties. The observations on the variation in intensity 
of the reflected beam with angle of incidence, for a homogeneous beam 
incident on the crystal, led to the conclusion that there exists, associated 
with the corpuscular kinetic energy of the electrons, a wave motion for 
which the wave length X (known as the de Broglie wave length) is related 
to the momentum pv by an equation identical with that used by 
Compton, of the form 

(11) 



where V is the potential difference through which the electrons are 
accelerated in acquiring the velocity v. 

These observations were made with low-velocity electrons, but G. P. 
Thomson showed a little later that high-velocity electrons are diffracted 
by thin metal films in exactly the same manner as X-rays, thus repeating 
with electrons the famous experiment by which Laue had demonstrated 
in 1913 the wave nature of these rays. Subsequently it was shown by 
A. J. Dempster that, in the reflection of protons from crystal surfaces, 
the phenomena observed indicate, for this case also, a wave length 
associated with the corpuscular momentum which is given by equation 



Before discussing the significance of this relation, it is essential to con- 
sider what values of X we may expect, on the basis of this equation, for 
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certain cases of corpuscular motion. Since h = 6.56 X 10~ 27 erg sec., 
it follows that for M = 1 gm. and v = 1 cm. per sec., X 6.56 X 
1(T 27 cm. This is much too small to be measured by any sort of grating 
available, and hence cannot be observed experimentally. From the 
investigations of crystal lattice structures by means of X-rays, it has 
been shown that the distances between atoms in such crystals are of the 
order of 10~ 8 cm. This therefore determines, for the magnitudes of 
corpuscular wave lengths that may be observed by means of crystals, 
values ranging from 10~ 7 to 10~ 10 cm., while optically ruled gratings 
enable us to measure wave lengths exceeding 1(T" 6 cm. The values of 
pv corresponding to the lower range of wave lengths are h/lOT 7 to 
A/10" 10 , that is, from 6.55 X 1(T 20 to 6.55 X 1(T 17 gm. cm. sec."" 1 , and 
such values are obtained only with atoms or electrons. For instance, 
in the case of a hydrogen molecule (/* = 3.3 X 10~ 24 gm.), the velocity 
at room temperature is about 2 X 10 5 cm. per sec., and therefore 
pv = 6.6 X 1CT 19 gm. cm. sec."" 1 , while for an electron (M = 9 X 
10"~ 28 gm.) having a velocity 5.9 X 10 8 cm. per sec. (corresponding to 
a fall through a potential of 100 volts), pv = 5.3 X 10~ 19 gm. cm. 
sec."" 1 and X 1.24 X 10~" 8 cm. 

It is for these reasons that phenomena exhibiting the characteristics 
associated with waves may be observed experimentally only with such 
ultramicroscopic particles as atoms and electrons and cannot possibly be 
detected, at least in the light of present knowledge, with macroscopic 
corpuscles. 

1.7 Principle of Indeterminacy. These effects which have been 
described rather briefly in the previous sections thus lead to conclusions 
which are apparently quite opposed to notions inherited from classical 
physics. Classical physics conceived light as an undulatory motion in a 
hypothetical ether; the theory of relativity discarded the ether, and it 
would appear that the quantum relations obliterate the waves. On 
the other hand, while the experiments on deflection of electrons in 
electrostatic and magnetic fields led physicists to assign to electrons a 
mass M and a velocity v, as well as a charge e, the experiments on diffrac- 
tion lead to the conception of " electron waves/' to which a definite 
" wave length " may be assigned. 

What explanation can be deduced for this seeming dualism in the 
behavior of both radiation and matter? The answer, first perceived 
most clearly by Heisenberg and Bohr, is that this dualism is actually 
inherent in the experimental arrangements used, in the agencies of 
observation themselves. The nature of the experiment controls the result 



The difficulty is this, that we have always assumed that we could 
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treat phenomena as something apart from the tools used in the observa- 
tions. After all, as Eddington reminds us, " The world of physics is a 
world contemplated from within, surveyed by appliances which are 
part of it and subject to its laws/' whereas it had been assumed that 
such observation revealed something that is independent of the mode of 
observation itself. 

When we measure a length with a meter stick, or observe the position 
of an oil drop through a telescope, we are justified in assuming that the 
act of observing has introduced no effects on the object of observation. 
Consequently, it is possible, in ordinary dynamical problems, to specify 
the instantaneous state of a particle in terms of its position (which we 
shall designate by x) and its velocity v or, more accurately, its momen- 
tum p - p,v. From a knowledge of the forces acting on the particle, 
it is then possible to predict its subsequent behavior, as, for instance, its 
position and velocity after any period of time t. Such a prediction is 
valid because it is possible to make observations on the initial conditions 
without " spoiling " the results of the measurements, " However," 
as Heisenberg has pointed out, 5 

This assumption is not permissible in atomic physics; the interaction between 
observer and object causes uncontrollable and large changes in the system being 
observed, because of the discontinuous changes characteristic of atomic processes. 
The immediate consequence of this circumstance is that in general every experiment 
performed to determine some numerical quantity renders the knowledge of others 
illusory, since the uncontrollable perturbation of the observed system alters the 
values of previously determined quantities. If this perturbation be followed in its 
quantitative details, it appears that in many cases it is impossible to obtain exact 
determination of the simultaneous values of two variables, but rather that there is a 
lower limit to the accuracy with which they can be known. 

For instance, in the Bohr theory of the hydrogen atom, the motion of 
an electron around the nucleus is treated on the same basis as the motion 
of the earth around the sun. It is assumed that we can measure both 
the position and velocity of the electron at any instant and that from 
this we can derive a magnitude which we designate as frequency of 
revolution in an orbit. But can position and velocity be specified 
simultaneously for an electron in an atomic system? Heisenberg's 
answer is that this is impossible. In fact, the more accurately we 
attempt to determine the position, the less accuracy we attain in the 
measurement of velocity, and vice versa. 

As an illustration, let us consider the manner in which we might try to 
determine the position at any instant of an electron in motion. In 
order to see the electron, it must be illuminated, and from optical con- 

8 W. Heisenberg, "The Physical Principles of the Quantum Theory," p. 3. 
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siderations it is known that for an ideal lens the uncertainty As in the 
determination of x is given by the relation 

A*= > (12) 

sin & 

where X is the wave length of light used and 26 is the aperture of the lens 
(see Fig. 3). Thus 6 should be chosen as large 
as possible, and X as small as possible. Theo- 
retically we could use gamma rays, the shortest 
wave lengths of radiation obtainable. To make 
the observation it is necessary that at least one 
photon should be scattered by the electron and 
pass through the microscope lens to the eye of 
the observer. In consequence of the Compton 
effect, the electron receives a recoil, and the 
amount of this recoil cannot be determined 
FIG. 3. Illustrating the g j nce ^ e } ens receives in the same focus all the 
Principle of Indeter- myg originating in the ang l e 20. Thus the 
mmacy. uncertainty in the magnitude of the loss in 

momentum of the electron in the z-direction is given by 




sn 



(13) 



The inequality indicates that the magnitude of the inexactitude will 
never be less than h sin 0/X, but may be greater, owing to physical 
imperfections in the experimental arrangement. 

Consequently, we arrive at the very significant result 

Az Ap, ^ h. (14) 

To minimize the loss in momentum, we might use radiation of much 
greater wave length. In fact, we might attempt to measure the velocity 
of the electron by means of the Doppler effect, and in order to increase 
the accuracy of observation, it would be necessary to work with very 
long-wave-length radiation, but this would increase the inaccuracy in 
determination of position. 

In the foregoing discussion, use has been made, on the one hand, of 
the wave theory in connection with resolving power of the lens, and, 
on the other hand, of the corpuscular theory of the Compton effect. 
However, the conclusion stated in equation (14) may be derived also 
from a consideration of the wave properties exhibited by electrons when 
made to pass through a slit. We imagine a homogeneous beam of 
electrons incident in the normal direction on a screen containing a slit. 
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In order to fix the position of one of the electrons at the instant of passing, 
we must choose a slit of extremely narrow width d. The coordinates 
parallel to the screen are thus determined with the accuracy 

Ax = d. 

But if d is comparable in magnitude with the de Broglie wave length \, 
the electrons will be deflected at the edges (diffraction phenomenon). 
Consequently, the emergent beam has a finite angle of divergence 0, 
which, according to the laws of optics, is determined by the relation 

. a \ \ 

sin 6 = 3 = > 
d Ax 

and the momentum of the electrons in a direction parallel to the screen 
is uncertain, after passing the slit, by an amount 

Ap = -sinfl. 

A 

From these two relations, equation (14) follows, as before. 

A similar relation is valid for the simultaneous determination of E, 
the enorgy, and t, the instant of observation. For, in order to determine 
the difference in frequency Av between two frequencies v and v + AJ>, we 
must extend the observation over an interval of time At = l/Av. Hence 

hAvAt = AE At^ h. (15) 

The conclusions stated in equations (14) and (15) constitute the 
generalization which is known as Heisenberg's Principle of Indeter- 
minacy , 6 and though it does not enable us to make any calculations on the 
behavior of atomic systems and electrons, it is extremely important 
in indicating the nature of the predictions which can be made about such 
particles. 

Heisenberg's principle postulates that there exists a fundamental 
limitation governing the possibility of associating exact determination of 
position with exact determination of momentum, when dealing with 
such systems as atoms and electrons, and the reason for this is the fact 
that any observation on atomic systems or electrons involves an inter- 
action with agencies of observation, not belonging to the system. Thus 
the initial conditions in any dynamical problem involving atoms are 
indeterminabk to the extent defined by equation (14), and consequently 
we cannot expect classical methods to be valid for calculating the 
behavior of a microscopic system such as an atom or an electron. 

8 What Heisenberg designated as "Unbestimmtheit" has been translated into the 
English equivalents: inexactitude, indefiniteness, uncertainty, and indeterminacy. 



16 QUANTUM PHENOMENA 

That this limitation is of negligible significance in the calculation of 
macroscopic systems is readily evident from considerations similar to 
those advanced previously in the discussion of the undulatory phenomena 
associated with corpuscular motion. The experimental limitations in 
the accurate determination of either position or velocity, in dealing with 
the motion of ordinary masses, are so large that the Heisenberg in- 
exactitude relation becomes completely obscured. This is no longer 
true, however, when dealing with the motions of electrons in atomic 
systems. In view of the impossibility of determining accurately the 
initial conditions in these cases, a precise statement of subsequent 
occurrences is no longer possible. What, then, can be calculated with 
regard to the behavior of such a system? 

In the ordinary affairs of life we have learned to solve such problems 
by applying the methods of the theory of probability. Thus the life of 
any individual human being is indeterminate in duration, but life 
insurance statistics enable us to state the life expectancy for any in- 
dividual at a given age. Similarly, in the manufacture of any piece of 
mechanism, where such production involves a large number of units, it is 
possible to predict on the basis of statistical information what the prob- 
ability is for the occurrence in any unit of a given type of characteristic. 

In the kinetic theory of gases we have the well-known probability 
distribution laws of Maxwell and Boltzmann. These laws state the man- 
ner in which the probability of occurrence of a given range of velocities 
or energies varies with the velocity or energy. Thus it is found that, 
while there is a decreasing probability for the occurrence of very high 
or very low velocities, there exists, for each temperature and composition 
of gas, a certain velocity for which the probability is a maximum. 

Now let us return to the consideration of the problem in atomic 
mechanics. Here, as has been mentioned previously, we are confronted 
with the fact that initial conditions are defined only within the limits 
determined by Heisenberg's principle. In view of this uncertainty, 
it is evident that all that we may expect to determine from the solution 
of a problem on the behavior of an atomic system is the probability of 
occurrence of any individual event. That is, the new quantum 
mechanics is essentially a technic for the calculation of statistical prob- 
abilities, and not one which enables us to predict absolute certainties in 
the same sense as we have been led to expect from ordinary mechanics. 

Since, as has been emphasized previously, the indeterminacy becomes 
less and less significant with increase in the values of pv and x beyond 
those dealt with in the consideration of atomic systems, it is evident that 
for macroscopic phenomena the new quantum mechanics must yield 
results which are identical with those derived by classical mechanics. 
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Bohr had recognized the necessity for fulfilling this condition in develop- 
ing his hybrid theory of classical mechanics and quantizing principles. 
As a result he formulated his famous Correspondence Principle, and, 
in the new quantum theory as well, the spirit of this principle has been 
maintained. For instance, as shown by both C. G. Darwin 7 and E. H. 
Kennard, 8 the path of a macroscopic particle, falling freely under the ac- 
tion of gravitational forces, when derived by the methodsof Schroedinger, 
is found to be identical in form with that derived by the method of 
Newtonian mechanics. If we adopt the language of the quantum 
theory, we may describe this result thus: there is an infinitely high 
degree of probability that, at the end of a given interval of time, the 
magnitudes of IJLV and x will approach certain determined values more 
closely than any differences that can be measured, even with the utmost 
possible physical precision. On the other hand, the ordinary calcula- 
tion states that, at the end of the given interval of time, fw and x will 
have these actual definite values. In other words, for large-scale phe- 
nomena, classical mechanics merely states as a certainty a result to 
which quantum mechanics assigns such an extremely high degree of 
probability that for all practical purposes it becomes a certainty. 
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CHAPTER II 
THE SCHROEDINGER EQUATION IN ONE DIMENSION 

2.1 The Concepts of Quantum Mechanics. It is the concept of 
probability that characterizes the new point of view, and quantum 
mechanics represents a modification of classical mechanics which 
enables us to predict the behavior of any system from the point of view 
of the Principle of Indeterminacy. As a consequence of the application 
of this new mechanics, certain conclusions have been deduced which 
are quite different from those expected on the basis of Newtonian 
mechanics. On the other hand, the new theory leads to discrete energy 
states for atomic systems by a much less artificial mode of derivation 
than was possible in the Bohr-Sommerfeld theory. Furthermore, the 
calculation now leads to correct solutions in those cases where the latter 
theory failed to give a satisfactory answer, while it yields the same results 
as the older theory wherever the latter did give results in agreement with 
observation. 

The actual mathematical technic of the new quantum mechanics 
has evolved from what appeared at first to be three different lines of 
attack. The first one, originated by W. Heisenberg, is a purely symbolic* 
type of mathematics and is quite unsuitable for elementary presentation. 
The second one, developed by E. Schroedinger (we shall use the symbol 
S. in referring to him), has enjoyed greater popularity, probably, as 
Eddington has suggested, " because it is the only one that is simple 
enough to be misunderstood/' The third line of development is that 
presented by P. Dirac in his treatise, " The Principles of Quantum 
Mechanics "; one which involves, like Heisenberg's treatment, a sym- 
bolism which is apt to repel any who are not " mathematically minded." 

In the presentation which the writer has attempted in the following 
sections, only the most essential aspects of the S. technic are considered, 
without regard to the particular arguments by which S. actually de- 
veloped his equation. The reader who desires to obtain an idea of the 
actual method of derivation used by S. will find this presented in his 
original papers and some of the treatises on quantum mechanics. 1 

However, since it is impossible to present even the simplest formula- 
tion of the S. theory without recourse to certain fundamental mathe- 

1 See the list of "General References on Quantum Mechanics," in Appendix I; 
also E. Schroedinger, Ann. Physik, [4] 79, 361, 489 (1926). 
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matical ideas, it has been considered advisable to precede the derivation 
of the S. equation and the consideration of some of its applications by 
some remarks on these more purely mathematical aspects. 

2.2 Some Fundamental Differential Equations. 2 A differential 
equation is a quantitative expression of a hypothesis regarding the 
mechanism of a given phenomenon. In general, what we measure as 
" pointer-readings " are the results over a period of time, or over a 
finite distance, of certain forces acting between the bodies constituting 
the system under observation. These forces vary with time and with 
changes in relative arrangement of the parts of the system, but from the 
integration of the differential equation, whenever this is feasible, it is 
possible to deduce the total change in the magnitudes defining the state 
of the system under any given conditions or at any instant of time in 
terms of the initial conditions. 

(1) Let us consider the very simple law governing the motion of a 
body under the action of a constant force. Let F denote the force, M the 
mass, and s the distance measured along the path of motion. 

By definition, the force is equal to the rate of change of momentum. 
Hence 



where v = velocity at any instant. 

For velocities which are small compared with that of light, /x is con- 
stant. Therefore, we can write 

,_,*. 

But velocity is defined as rate of change of distance along the path. 
That is, ^ 

"' 



Consequen 



tly, we obtain the result 



Since F is constant, we obtain the differential equation of (he second 

** F A n 'm 

^---0 or ^-.-0, (3) 

where a is the " acceleration." 
2 See list of references on mathematics, at the end of the chapter. 
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We now proceed to find s as a function of t. This will give us a 
" solution " of the differential equation. Integrating once, we have 



so that 



dt 






or 



ds 

= at + V Q , 

at 



where t> = (ds/dt)^ is the velocity at t = 0. This is a differential 
equation of the first order, and integration of this equation gives the result 

where s$ is the value of s at t = 0. 

Figure 4 shows plots of (1) F/p = d 2 s/dt 2 = a, the acceleration; 
(2) v = ds/dt, the velocity; and (3) s, the 
distance traversed in time t, all as functions 
of t. 

Equation (4) is designated a general solu- 
tion of equation (2), and it will be observed 
that, in the process of solving the latter, two 
constants of integration are introduced, one 
of which corresponds to the initial value of 
ds/dt and the other to the initial value of s. *t 

The differential equation (3) expresses the FlG - 4 - Geometrical inter- 
pretation of a simple dif- 
ferential equation and of 
its solution. 



r 

v 

L 




equation of motion of a body under the action 
of gravity, where F/p = g } the gravitational 
constant. 

(2) The special case of equation (3) in which F, and consequently, a, 
is equal to zero, gives rise to the second-order differential equation of 
the form 

(5) 



which leads to the first-order equation 

dy . . 

= c, a constant, 
ax 

and this in turn, to the result 

y = ex + d, 

where d is a second integration constant. 
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The last equation is that of a straight line, and c defines the slope, 
while d defines the intercept on the y-axis, since y = d for x = 0. 

(3) A very important type of second-order differential equation is 
that representing a simple harmonic motion. If a particle of mass p 
moves with respect to a fixed point in such a manner that the restoring 
force acting on the particle is proportional to the displacement (r) from 
the fixed point, the differential equation for the motion has the form 

p a- p - kr, (6) 

where k is a constant which we shall regard as positive. 
In order to integrate this equation, we try the solution 

r = r we . 

Hence -rs * r w 2 e m ', 

at 

and substituting in (6) we obtain the " characteristic equation " 



k 

that is, m 2 + - 0. 

M 

The roots of this equation are m = i\/fc/M = =fcico, where w = Vfc//i, 
and i = V^l. 

Consequently, the general solution or complete integral has the form 

r = C ia " + Dr', (7) 

where C and D are two arbitrary constants of integration. 

The right-hand side of equation (7) may be expressed in a more 
familiar form by making use of the series expansion formulas 3 for 
e iut and 



* ' 1-2 1-2- 3 ' 1-2-3-4 ' 

3 These expansions are derived from Maclaurin's series, which in turn is a special 
application of Taylor's theorem. 
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Therefore, 



+ .-- 2 (l + f^- +r ^- 1+ ... 



21 - 



I 1 -2-3-4 
2 cos orf, (8) 



and 

-. ~^ = 2iLt- 



jut -'-'.'/ I , V WI/ ' V W ^/ 



2-3 1 -2 -3 -4-5 

= 2i sin ut. (9) 

That is, 

*w = cos at + i sin co* (10) 

C IW< = Qg ^ ^ gj^ ^ (11) 

In consequence, equation (7) may be expressed in the form 
r = A cos co + 5 sin w 



= Acos.fe .f + BainJ- - *, 
\M \M 



(12) 



where the constants A and B are given in terms of C and D by the two 
sets of relations 

A = C + D\ B = i(C - D) (13) 

2C=A-*5; 2D = A + iB. (14) 

That is, in the general case, C and D are complex conjugate quantities. 
If, as is customary, we designate the complex conjugate of any quantity 
by a bar over the symbol (or an asterisk), then it follows from equation 
(14) that 

C = D, and D = C, 

J2 i I?2 

while C = DD = 7 (15) 

4 

The quantity on the right-hand side of the last equation is always real 
and is designated the norm of the complex quantity C or D. The 
positive value of the square root, that is VA 2 + 5 2 /2, is known as the 
modulus of C or D. This is usually expressed in the form 

2 a., 
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We shall now consider the physical interpretation of equations (7) 
and (12). Evidently we can write the latter in the form 

sin col 




VA 2 + B 2 sin ( + 5), (17) 



where sin 5 - A/VA 2 + B 2 and cos 5 - B/VA* + B 2 . 

As t is varied from t - to t = 2ir/w, r passes through the cycle of 
values indicated in the following table. 





tt 

VA 2 + B 2 sin 8 - A 



-1- B 2 sin~ = V A 2 + B 2 
& 



VA 2 + B 2 cos 5 - B 

2o> 



KH 



27T 



In the period T = 2ir/w, the value of r varies from to =t VA 2 + B 2 , 
and it is evident that, for any value of t = nr (where n = 0, 1, 2, etc.), 
the value of r will pass through such a series of oscillations n times. 
That is, the particle performs harmonic vibrations about the point r = 0, 
of which the amplitude is VA 2 + B 2 , and the frequency 



The angle 8 is known as the phase angle, and the particular value of 
this angle depends upon the initial conditions. In the case of equation 
(17), it is evident that the initial condition was r = A for t = 0, and 
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& 



tan- 1 -' 



On the other hand, for the initial condition r = B, the value of the 
phase angle would be 

5 = tan"" 1 -7- 
A 

Equation (17) expresses in the form of a single harmonic function the 
same motion as that expressed by the sum of two harmonic functions in 
(12). That the equations are equivalent may be demonstrated readily 
by considering the case A = B. Then the two equations become 



x = A (cos ^t + sin o>0> 



and, since sin ir/4 = cos 



x = 



1/V2, 
inf 



(ii) 



In the first of these equations, x is expressed as the sum of two identical 
harmonic waves which are 90 out of phase; in the second equation, 
x is expressed as a single harmonic wave, having an amplitude V2 times 
that of each of the waves in (i) and 45 out of phase with each of these. 




FIG. 5. Illustrating the superposition principle of wave motions. 



The significance of these considerations is illustrated by the plots in 
Fig. 5. Curve I corresponds to y = 3 cos orf; curve II to y 4 sin orf, 
and curve III to y - 5 sin (at + S) where 5 = 36.8. It will be ob- 
served that each ordinate for curve III is the sum of the ordinates, for 
the same value of t, for curves I and II. This illustrates a generalization 
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known as the Principle of Superposition, which is of extremely great 
importance in quantum mechanics. 

We could also incorporate the phase angle 6 into the exponential 
expressions used in equation (7). In that case, we put 



r = 

where 2C - (A - i 

and 2J9 = (A + i 

This leads to the result 

A 2 + B 2 = 400*70 



and VA 2 + B 2 = 2 | C | = 2 | C | , 

as before. 

Returning to the consideration of equation (12), it follows that the 
velocity of the particle at any instant is given by 

dr 

- = f = --Aco sin cot + Bw cos ut, 

and the initial momentum p = A^o is given by 



Consequently, we can write equation (12) in the form 

Po 
r = TO cos ut H -- sin co, (19) 

jUCO 

and the momentum at any instant is given by 

p = /*f = r w/x sin wf + p cos w^. (20) 



Again, it is often convenient to express the motion in terms of the 
total energy E and the frequency. 
From equation (6) it follows that the potential energy 

/f r 2 

Fdr= , 
* 

since by definition the potential energy 4 is the increase in energy due to 
the displacement of the particle from r = to r. 
Also, the kinetic energy at the point r is given by 



4 See more comprehensive discussion of potential and kinetic energy in Chapter IV. 



SOME FUNDAMENTAL DIFFERENTIAL EQUATIONS 27 

Hence, 

2 Ar 2 

^=r+F=^ + yi (21) 

and since this must be a constant for the system, it follows that p as a 
function of r will be represented by an ellipse. 

Comparing equation (21) with the equation of an ellipse in terms of 
the major and minor semi-axes a and 6, which is 



it is seen that the semi-axes of the ellipse will be given in the present 
case by V2pE for the axis along which p is measured, and by V%E/k for 
the axis along which r is measured. 
Since k = /ico 2 , it follows from equation (21) that 



p 2 + M Vr 2 - 2 M #, (23) 

which expresses p and r in terms of E. 

By substituting p and r in this equation it follows that equation 
(20) may be written in the form 



p = V2/J? cos (at + 8), (24) 

Vf\ 

where cos & = 

and sin 5 = 

Also it follows from (24) and (21) that we can write (19) in the form 

fsin (of + a). (25) 

These two equations thus express p and r in terms of the total energy 
and of w = 2m>. 

(4) Lastly, we shall consider the second-order differential equation 
of the form 



m^ = 0, (26) 

where m is a constant, and m 2 is always positive. 
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As in the case of equation (6) we try the solution 

y = A* kx , 



so that 6 
Hence, the " characteristic equation " is 



which has the roots A; = =kw, and this leads to the general solution for 
equation (26) of the form 



y = A* mx + Br mx . 
For x = 0, 3/0 = A + B, and 



(27) 





2/0 - 



Thus the two integration constants can be expressed in terms of the 
values of y and y f for x = 0. 

Corresponding to the exponential expressions for the sine and cosine 

functions given in (8) and (9), we 
have the following relations: 




r mx = 2 cosh 



(28) 



where cosh and sinh denote the 
" hyperbolic " cosine and " hyper- 
bolic " sine functions, respectively. 
Hence (27) may be written in the 
form, analogous to (12), 

y = (A + B) cosh mx + 
(A - 5) sinh mx, (30) 

where (A + B) and (A - J5) evi- 
dently may be replaced by two 
new constants, thus retaining the 
general form of solution with two 
arbitrary constants. 
Figure 6 shows the graphs for cosh x and sinh x, and it will be observed 
that the two functions become more and more nearly equal to each 

6 The notation y' = dy/dx, y" = A/da?; y - dy/dt, y = cPy/df, is used very 
frequently in mathematical treatises. 



FIG. 6. Plot of hyperbolic cosine and 
sine functions. 



EQUATIONS FOR THE PROPAGATION OF WAVE MOTION 29 

other for large positive values of a?, while they diverge more and more 
with increasingly negative values of x. 

2.3 Equations for the Propagation of Wave Motion. 6 We shall now 
consider the manner in which it is possible to express the propagation of a 
wave motion along a single coordinate axis. Such an expression should 
give the amplitude of the wave at any point a; as a function of the time t. 

Let us consider the expression 

y = t/o cos (ax - o>0> (31) 

where a and o> are two (positive) constants. 

For values of t = o#/w, the amplitude y repeats itself. Hence, 
equation (31) must represent the propagation of a wave motion for 
which the velocity of propagation (phase velocity) is u = x/t = w/a. 

If v denote the frequency, and X the wave length, 

u = *>X, 



j u 
and hence - = = v\, 

a a 



so that a = - = 2w, (32) 



where <r = - = wave number. 

A 

Consequently we can write (31) in the more convenient form 

y = 2/0 cos 2*((rx - vt). (33) 

If x becomes more positive, and t is made more positive to such an 
extent that t = ex/v, the amplitude y repeats itself. Therefore, equa- 
tion (33) must represent a wave motion for which the direction of 
propagation is the same as that of increase in the value of x, that is, 
from left to right. 

On the other hand, if we wish to indicate propagation from right to 
kft, it is evident that the corresponding expression must be 

y = 2/o cos 2ir(<rx + vt\ (34) 

since, in this case, for positive increase in t, x must be made negative in 
order to make y repeat itself. 

6 See references at end of chapter. 
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It is also evident that for propagation from kft to right we could use 
the relation 

y = J/o cos 2v(vt ax), (35) 

since cos 6 = cos (9), while for propagation from right to kft, we could 
use the relation 

y = y Q cos (-2im< - 2iroaO. (36) 



If now we consider the expressions 

y = J/o cos 27r(tf rf) + i sin 27r(cr vf) 

- y<f r **r* v ** (37) 

and y = j/ r 2 *e 2 ', (38) 

we observe that we could use either relation to express a propagation 
from left to right, while for motion in the opposite direction we could 
use either of the two following relations: 



y = y ^***M (39) 

or y = yaT*****-*'**. (40) 

Now in quantum mechanics it is conventional, and there are also 
definite reasons for the procedure, as will be pointed out subsequently, 
for choosing those expressions in which t~ 2vivt occurs. Under these 
conditions we shall regard e 2lriffx as indicating a wave motion from kft to 
right, as follows from equation (37), while we shall regard e~ 2vtffx as 
indicating propagation from right to left, as follows from equation (40). 

We may also proceed now to derive from these relations the differential 
equation for the propagation of a wave motion. 

Consider the relation 



y = 
Then, we derive the following partial differential coefficients: 



The symbol d is used to designate differentiation of a function of more 
than one independent variable, with respect to one of these variables, 
maintaining the other variable constant. In the present case y y (x, t) , 
and hence arises the necessity for using the notation for partial differen- 
tiation. 
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Similarly, we derive the relations 

*. 

d*y 
Hence, 2 - 



u* ' dt* 



' 



where u = v\ = w == phase velocity. 

This equation is the desired partial differential equation which is 
evidently satisfied by each of the particular solutions given in equations 
(33), (34), (37), (38), (39), and (40). 

We shall proceed, in the following section, to deduce the same differ- 
ential equations from more fundamental considerations, and shall then 
indicate the method by which a general solution may be deduced. 

2.4 Differential Equation for the Vibration of a String. 7 It is a 
familiar fact that a string stretched between two fixed points, when 
made to vibrate, exhibits nodes and loops. The nature of the wave 
pattern is determined by the length of the string L, in accordance with 
the relation 



where n is an integral value and X/2, which is one-half the wave length, 
is the distance between two consecutive nodes. 

Similarly, a stretched membrane, organ pipe, or any other vibrating 
system exhibits definite wave patterns when vibrating, and in acoustics 
use is made of this fact to produce notes of definite frequencies or wave 
lengths. This observation that the vibrations of such systems are 
characterized, not by a continuously varying range of frequencies, but 
by a series of discrete values of these frequencies, is analogous to the 
spectroscopic observation that an atomic system may exist only in a 
series of states corresponding to a discontinuous scries of values for its 
energy. It is this analogy, which is of a purely mathematical nature, 
that Schroedinger utilized in developing his " wave equation " for 
interpreting the behavior of electrons and atomic systems, and, there- 
fore, before proceeding with the consideration of this wave equation, it 

7 See K. K. Darrow, Bell System Tech. /., 6, 653 (1927), for a discussion of the 
differential equations for vibrating systems in two and three dimensions. 
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is necessary to understand some aspects of the mathematical treatment 

of the problems of vibrating systems. 
The simplest of these problems is that of a stretched string, as for 

instance, the wire of a piano, or the string of a violin. It is a problem 
involving only one coordinate in space, which 
is the distance along the string, and also time 
as a second variable, since the amplitude is a 
function of the time. Let us consider a stretched 
string, infinitely long, extended along the x-axis. 
Let T denote the tension along the string (that 
is, the force which maintains the string in a 
stretched condition), and p, the mass per unit 
length. To derive the differential equation for 

the motion of the string, we consider the forces 
FIG. 7. Illustrating der- ftcti Qn an element of length ^ which is so 
ivation of partial dif- , , & x , , ., , j j A - n 

ferentiai equation for short that li m W be regarded as practically 
vibration of a string. straight. When the string is drawn sideways 
(in the direction of the #-axis), these forces have 

components along the two axes of coordinates. Let 8 denote the angle 
between the element and the original position of the string (along the 
x-axis), as shown in Fig. 7. 

The mass of the element Ax is pAx, and its acceleration in the direction 
of the j/-axis is 6 2 y/dt 2 . Hence, the force acting on the element is 




(41) 



The force F is balanced by the force arising from the difference between 
the tensions at the two ends of the element Ax. That is, 

F = T[sin (0 + A0) - sin 6] 

= Tftan (6 + A0) - tan 0], 
since, for small values of 0, sin and tan are approximately equal. 



But 
and 



tan0 = - 
dx 



tan (0 + A0) - tan = A tan 



dx 
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Hence, F = T -^ As, 

ox 

and, comparing this equation with (41), it follows that 



thatis ' 5?-?d (42a) 

This relation is usually written in the form 

v"-f * (426) 

where primes denote differentiation with respect to #, and dots, dif- 
ferentiation with respect to t. 

Equation (42a), or (42&), is the partial differential equation which 
represents a wave motion along a stretched string. It is of interest to 
consider the significance of the coefficient p/T. The constant p has the 
dimensions of mass divided by length, that is /l, while T has the dimen- 
sions of a force, that is id/t 2 . Thus, p/T has the dimensions t 2 /l 2 , which 
are identical with those of 1/u 2 where u denotes a velocity. Therefore, 
we may replace p/T by a constant 1/u 2 which has the additional ad- 
vantage of indicating that this coefficient corresponds to a positive 
magnitude. (This is the customary method in mathematical equations 
of indicating such a condition and will be used quite frequently in 
the following sections.) 

To solve the partial differential equation of the form 



the classical method of procedure is that known as solution by separation 
of the variables, which is applicable provided it is a constant. 
Let us assume 

y - f(x)g(t), 

where/(a;) is a function of x only, and g (t) a function of t only. 
Hence, 2 



and d 2 

if -/(*) -/<*) 
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Since f(x) and g(t) are each functions of only the corresponding vari- 
able, the partial differential coefficients may be replaced by ordinary 
differential coefficients. 

Substituting in equation (43) and dividing by f(x)g(t), we obtain 
the relation 



f(x) 

Since the left-hand side is independent of t, while the right-hand side 
does not involve x, we may equate each term to an arbitrary constant, 
which we shall designate by m 2 (to indicate that it corresponds to a 
negative magnitude). 

We thus obtain two ordinary differential equations, 



0, (44) 

Wit 

and d?a(t) 

-|^+mV0(0-0. (45) 

These equations are similar to equation (6), and therefore the general 
solutions are of the form given in equation (7) or (12). Expressed in 
the exponential form, these solutions are as follows: 

f(x) = At imx + Br imx (46) 

g (t) = Ce imut + Dr imut , (47) 

where A, J5, C, and D are four arbitrary constants, the values of which 
are determined by the " boundary " and " initial " conditions. 

In the case of the string fastened at both ends, y = f or both x = 
and x = L, where L = length of string. 
Therefore, for x = 0, 

f(x) = - A + B. 
Hence, 

A = -B. 
For x = L 

f( x ) =0 = A(e iwL - 

= 2At sin mL. 



Since A is not equal to zero, sin mL ~ 0, which means that 
where n = 1, 2, 3, etc. 



mL nv\ m > (48) 

L 
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Thus, the particular solution corresponding to equation (46) is 

nirx 

f(x) = 2Ai sin 
LJ 

For x = L/2n, f(x) = 2Ai. Since f(x) must be real in the case of a 
vibrating string, A must be a pure imaginary, and therefore Ai = \ A \ 
AO, a real quantity. Hence, the last equation should be written 

nirx 
/(*) =2ji sin (49) 

L 

Turning now to the solution in equation (47), we have, as initial 
condition, that is for t = 0, 

g (t) = o = C + D; and C = -D. 
As for/(z), we thus derive the solution 

g(t) = 2Ci sin mut, 

and inserting the value of m derived in (48), this becomes (since Ci 
must be real) . wit 

g(t) = 2C sm -7-- 
L 

Now if X denotes the wave length, and v the frequency, it is evident 
that /(#) must become zero for x = 0, x = X/2, x = X, and so forth. 
Therefore, in equation (49) we can write 

T nX *U x ' X 2L 

L = > that is X n = > 
2 n 

which states that L must be an integral multiple of one-half the wave 
length of the note emitted by the vibrating string. 
Consequently, 

f(x) = 2A Q sin 



A T n 

= f or x = L = - 
2 

Also it follows, since u = v n \ n , that 

g(t) = 2C sin 2irv n t. 

In these last two equations, v n designates the frequency of the har- 
monic or nth normal mode of vibration and X n designates the correspond- 
ing wave length. 
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Thus, the particular solution of equation (43) is of the form 

* sin 2irv n t, (50) 



u ^ 2L X 

where X n = - > 

n n 

w rm 

= _-- = , 

and X and v are the fundamental wave length and frequency, respec- 
tively. 

It will be observed that the differential equations (44) and (45) have 
physically significant solutions only for those discrete values of m which 
are defined by equation (48). These are known as characteristic values 
or eigenvalues, and the corresponding values of the functions f(x) and 
g(t) t as defined in equation (50), are known as characteristic functions 
or eigenfunctions. 

Corresponding to each frequency v nj there will be a definite vibration 
of an amplitude defined by the magnitude of the coefficient AC, and 
there will exist n loops (regions of maximum amplitude) and (n 1) 
nodes (regions of zero amplitude) between the ends of the wire. 

It is evident that equations (46) and (47) represent a general solution 
of equation (43), of which equations (33), (34), etc., in section (#.3) 
are particular forms. 

2.5 Schroedinger Equation for One Coordinate. The actual stimulus 
to the wave mechanics, as it is designated, developed by Schroedinger 
was derived from certain theoretical speculations of Louis de Broglie re- 
garding the analogy between the laws of geometrical optics and those 
of classical dynamics. 

As is well known, the laws of geometrical optics are more and more 
valid, the shorter the wave length of light. For light waves comparable 
in length with those of the object upon which they impinge, that is, for 
rays having a radius of curvature comparable with the wave length, we 
must interpret the observations from the point of view of the undulatory 
theory. This suggested to de Broglie the possibility that Newtonian 
dynamics is also an approximation which is valid for macroscopic 
systems, but not for atomic systems because the radius of curvature of 
the electronic orbit is of the same magnitude as the wave length asso- 
ciated with the corpuscular motion. 

Let us consider, for instance, the Bohr orbit of the electron in the 
normal state of the hydrogen atom. The quantum condition, intro- 
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duced by Bohr for determining the discrete energy states, is given by the 
relation ^ 

^r = -, 

where v denotes the velocity of the electron in the circular orbit of 
radius r. If a wave length X is associated with this motion, the cir- 
cumference of the orbit must be an integral multiple of X, and for the 
simplest case this multiple will be unity. 
Hence, 

27TT-X. 

Comparing these two equations, if follows that 



which is the famous de Broglie relation. 

As has been mentioned in Chapter I, this suggested association of wave 
motion with corpuscles in motion has been confirmed by the investiga- 
tions on diffraction of electrons and protons. We must now consider 
the manner in which this observed dualism in the behavior of corpuscles 
has been utilized by Schroedinger in deducing his famous equation. 

Let E denote the total energy of a particle, T the kinetic energy of 
the particle, and U the potential energy at any point in space. Then, 
in accordance with the law of conservation of energy 

E = T+ U 
and 

T = E - U. 

For a single particle moving in a field of force, such as an electron in 
the hydrogen atom, 

T = fa 2 ~E-U, 
and therefore 



- U), 
while, since X = h/(iw), 

X=* , k = (51) 

V2/i(# - C7) 

Since in general U is a function of the coordinates of all the electrons 
with respect to the nucleus, X must vary from point to point in the field 
of force. Thus we may regard the motion of the electrons as governed 
by that of the associated de Broglie waves. Each electron in an atomic 
system follows the direction along which these waves are " refracted " in 
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the field of force which is due to the simultaneous presence of all the 
particles. 

The Schroedinger (S.) equation is a partial differential equation, of 
the same nature as the differential equation for the vibration of a string, 
which indicates the manner in which the wave pattern, produced by the 
motion of an electron in a given field of force, varies with the space 
coordinates. Therefore, we introduce a function ^, analogous to the 
amplitude in a physically vibrating system, which defines the " ampli- 
tude " of the wave motion and is known as the S. function or space 
function. Its actual physical interpretation will be considered in a 
subsequent section. The simplest type of problem is that in which we 
are dealing with the motion of a single particle of mass JJL in a field of 
force for which the potential energy U is a function of only one co- 
ordinate x. 

Hence, ^ is a function of x and t only, and we denote this by writing 
\[/ = ^(x, t). Referring now to equation (43), which is valid for any 
vibrating system in which the motion occurs along only one axis of 
coordinate, we write the equation in the form 



,. 
5? (52) 



where v\ takes the place of u, the velocity. 
As in the solution of equation (43), we assume that 



*(*, - *(oO<T 2 e , (53) 

where the function corresponding to g(t) in equation (47) is assumed to 
be of the exponential form with a negative sign in the exponent. Evi- 
dently this satisfies the partial differential equation (52) . As a matter of 
fact, since we are interested for the present in the stationary wave patterns, 
we may postpone for a subsequent chapter any further consideration of 
the exact function by which to represent the variation in $(x, t) with t. 

Incorporating the relation for ^, assumed in equation (53), into the 
differential equation (52), we derive the relations 
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In these equations <f> is used instead of <t>(x). 

Substituting the last two relations into (52), the result is the ordinary 
differential equation 



In consequence of equation (51) this equation assumes the form 



This is the form of the S. equation for one coordinate, and it will be 
observed that, since U is a function of x, the equation cannot be solved 
as readily as the differential equations considered in the previous sections. 
The mathematical technic of solution thus depends upon the nature of 
the function U = U(x). 

2.6 Motion of Electrons in Absence of Field of Force. In absence of 
a field, that is, for U = 0, the S. equation assumes the form 



n .... 

-_ Q. (55) 

Since the coefficient of < is a constant, this equation has the same form 
as equation (44). Thus, the solution is 

= <t>(x) = At*"* + Br iax , (56) 

where a 2 = 8ir 2 pE/h 2 , and A and B are arbitrary constants. 

As emphasized in a previous section, this equation has physical 
significance only if a = 2ir/X, that is, if 



r 

(57) 



which is de Broglie's relation. 

What is the physical interpretation of the function <#>? In equation 
(37) it was shown that the expression 



- 
c 

represents a simple harmonic wave for which the direction of propagation 
is from left to right, while the expression 

iax 



represents a simple harmonic wave motion from right to left. 
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Thus, if in equation (56), we wish to designate a wave motion propa- 
gated from left to right, we can indicate this by putting B = 0. To 
obtain an interpretation of we proceed as follows: 

Multiplying <j> by its complex conjugate, the result is 



- Al - | A | 2 . (58) 

That is <$ is a real magnitude. In quantum mechanics tfadx is re- 
garded as the probability of occurrence of an electron in the element of 
distance dx at the point x. Since 0? is a constant in the present instance, 
we may interpret this magnitude as designating the number of electrons 
per unit length. The fact that this linear density is constant means 
that the electrons are uniformly distributed at all points along the z-co- 
ordinate from oo to + . This result is in agreement with the de- 
duction from the Principle of Indeterminacy. For an electron having 
a definite momentum p = /it; = V2pE, the associated wave motion is 
represented by a monochromatic (or unifrequentic) wave extending 
from oo to + oo , and the position of the electron is completely un- 
determined. There is an equal probability for the occurrence of the 
electron anywhere along the infinite wave train. 

The case A = 5 evidently corresponds to a stationary de Broglie 
wave. For 



= 2A cos ax r** ivt 
or 

= 2Ai sin ax r** ivt , 

depending upon the particular physical conditions to be satisfied. In 
this case 

</>?> = 4A 2 cos 2 ax 



or 

= 4A 2 sin 2 ax 



(59) 



and therefore the value of the distribution functions along the z-axis 
varies periodically between the values and 4A 2 . Again, we interpret 
the expression <$>dx as representing the relative probability of occurrence 
of electrons in the element dx at the point x. 

From these considerations it is possible to perceive the reasons for 
choosing the exponential functions e*"* and c""* '* for representing the 
motion of a unidirectional stream of electrons which have a definite 
velocity. It is only by means of these functions that it is possible to 
derive a value of <$ which is constant for all points along the direction of 
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motion, and which is in accordance with the observation that in such 
an electron stream the instantaneous location of any one electron is 
completely indeterminable. 

If we attempt to use the sine or cosine functions, the value of </>$ thus 
derived turns out to be a function of x, as shown in (59), which would 
indicate the possibility of observing that the electron density varies at 
different points along the direction of motion in accordance with the 
variation of cos ax or sin ax. Hence we are justified in using the latter 
representations only where observation indicates the existence of a 
corresponding density distribution along the axis of x. 

We shall now consider the interpretation of v and u for a de Broglie 
wave motion. Since it is not possible to measure v for such a wave by 
any experimental method (the only magnitude which may be deter- 
mined experimentally is the wave length), we assume that the frequency 
is defined by the quantum relation v = E/h, where E is the total energy. 
According to de Broglie, the value of E that should be used in calculating 
v is that derived on the basis of the special theory of relativity, that is, 
E + juc 2 , where c = velocity of light. However, in any observations, 
it is only energy differences that are actually measured, and, therefore, 
it is immaterial which value of the total energy is used. Owing to this 
indefiniteness in the absolute value of E, it follows that the value of 
u = v\ also cannot be determined by any experimental method. 

In treatises on quantum mechanics, u is designated as phase velocity 
and a distinction is drawn between this quantity, which cannot be 
observed experimentally, and the so-called group velocity, which is 
identical with the experimentally observed corpuscular velocity v. 
However, to the writer it seems that altogether too much emphasis 
has been laid on this topic, at least in an introduction to quantum 
mechanics. As a matter of fact, it is meaningless to speak of a velocity u, 
and a frequency *>, as if they were observable magnitudes. For this 
would imply a physical phenomenon in which something is, as it were, 
in a state of vibration. But actually we find that in many problems is 
a function of more than three coordinates. Thus, in the problem of the 
helium atom, <t> for the system is a function of the coordinates of each 
electron and, therefore, represents a vibration in a space of six dimen- 
sions. Obviously, " phase velocity " can have no physical significance 
in such a case, although it is possible to calculate a value of <t>$dxdydz, 
which gives the relative probability of locating one of the electrons 
in the element of volume dxdydz at any point in the space surround- 
ing the nucleus. 

As N. F. Mott has stated, 8 in referring to the function 0, "The wave 

8 N. F. Mott, "An Outline of Wave Mechanics," p. 51. 
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function is, so to speak, just a convenient shorthand. The waves are 
not waves in any medium. There are no waves accompanying the 
electron, until we have observed the electron. Then we make use of 
the wave representation to embody the results of an observation. The 
wave equation tells us what may be deduced, from our observations, 
about the future position and velocity of an electron." 

2.7 Operator Method of Deriving the Schroedinger Equation. This 
divorcing of the S. equation from any physical interpretation is demon- 
strated by considering an alternative method of deriving the equation 
which involves no mention whatever of a de Broglie wave length. 

Let us refer once more to the equation for the total energy of a system, 
as derived from classical mechanics. We write this, for a single particle, 
in the form 

E = V + U(x) 
or 

+U(x)-E = 0, (60) 

where p = pv = momentum. 

This equation gives the relation for the energy in the so-called Hamil- 
tonian form, that is, as a function of coordinates and corresponding 
momenta. This is indicated symbolically by writing E = H(p, q), 
where q is used instead of x to designate the coordinate. 9 

We now replace p by a differential operator, which we assume to be 
of the form 



where the partial differential coefficient refers to the more general case 
in which E is a function of several coordinates, and of their corresponding 
momenta. In the present case, the partial differential is synonymous 
with the ordinary differential, so that the symbol d may be replaced by 
d. Thus, we obtain the operator 



2 M <fc 2 
Applying this operator to the function <, the result is 



9 In Chapter IV this method of expressing the total energy is discussed more fully. 
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that is, 

0, (62 ) 



which is the S. equation. 

This method of deriving the S. equation involves a step which is very 
fundamental in the new quantum mechanics. Instead of speaking of 
observable magnitudes which characterize a given state of a system, 
we consider rather the kind of operations which may be performed upon 
the function, which is characteristic of that state. To each observable 
in the classical mechanics formulation there corresponds a certain opera- 
tor, and the results of observations on the system are predicted on the 
basis of deductions derived by operating on the characteristic function 
of a given state. 

This use of the so-called algebra of operators is characteristic of the 
new quantum mechanics, but it is worth while pointing out that, 
even in the more elementary types of mathematics, the use of operators 
is quite general. Thus, when we write y 3 we indicate the successive 
operations y y y, and the notation \/y indicates the operation of 
taking the square root. Other operators are log and 'sin (cos, cosh, 
sink, etc.). The notation sin 6 indicates a certain operation to be 
performed on the angle 6. Similarly, d/dB = DO corresponds to another 
kind of operation. Thus, the expression De sin 6 indicates the following 
sequence of operations: (1) finding the sine of 0, and (2) determining 
the rate of change in sin 6 with change in 6. It will be observed from 
this example that the order in which the operators are given is of funda- 
mental importance. 

Thus, although operators may be treated as algebraic symbols, that is, 
we may multiply and also (usually) divide by them, as if they were 
actual magnitudes, they are not in general commutative. That is, in 
general, if a and ft are two different operators 

otft^ fta 

and aft fta is known as the commutator of the operators a. and ft. As 
an illustration let us consider the commutator 

Dx -xD 
of the operators D and x. The result of operating on unity is 

d d 

(Dx - xD)l = x-l-z 1 
ax ax 

- 1 - - 1. 
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Again, if we operate on x, the result is 

(Dx - xD)x - 4- (* 2 ) -*T-S 
ax ax 

= 2x x = x> 
and similarly 

(Da; - xD)x n = (n + l)z n - ns n 



Thus, the commutator of the operators D and x is unity. 

In the case of the operator p as defined in equation (61), it follows that 

h d h d 



- a- <) 

That is, p and 9 are non-commutative, and the commutator of these 
operators, in the order p, q, is h/(2wi). 

This result represents the analog, in the new mechanics, of the Bohr- 
Sommerfeld quantizing condition for the determination of electronic 
orbits, which was stated in the form 

k = nh, 

where n is an integral value, p k and q k are " canonically conjugated " 
variables (see Chapter IV), and the circle on the integral sign indicates 
that the integration is to be carried out over the whole orbit (periodic 
path). 

From equation (63) it is possible to deduce the Principle of Inde- 
terminacy, and it is also clear, since p k and q k do not commute, that 
these variables cannot have the same relative significance in the new 
theory as they had in classical mechanics. 

It is of interest to point out further in what manner use may be made 
of the operator concept in deducing a physical result. 

In section 3 we considered the interpretation of the solutions 



for the motion of a homogeneous beam of electrons in absence of a field. 
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Applying the operator p to each side of the first of these relations, we 
obtain the result 



fa, (64) 

since a = (2v/h) ^/2^E. 

That is, the result of operating on the function <fo with the operator p 
leads to a relation in which the coefficient of <h is a constant. This 
means, physically, that if an experiment is performed on the beam of 
electrons to determine their momentum the observation will lead to a 
definite value given by the relation 



S. (65) 

The positive sign before the radical indicates that p and therefore 
v = dx/dt is positive. That is, the electrons are propagated from left to 
right. 

Similarly it may be shown that in case of </> 2 , 

p = -V2JJS, 

that is, the wave is propagated from right to left. 
The case 



= 2 cos ax 
or 

= 2i sin ax 

leads to the relation 

ha . 
p<t> = -- 7 sin ax, 



in the first case, that is, 



P(COBOX) = sin ax, (66) 



and in the second case, to the relation 



p(sin ax) = cos ax. (67) 

2ir 
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It will be noted that the last two equations differ from (64) where 
the result of operating on the function with p led to a product of a 
constant and the function which it was operated upon. In the case 
of equation (64) we were led to the conclusion, indicated by equation 
(65), that the momentum has a precise value. On the other hand, 
equations (66) and (67) indicate that we may no longer draw a con- 
clusion of this nature when dealing with the function for a stationary 
wave pattern. In fact, any attempt to determine the direction of mo- 
tion of electrons in the latter case will not lead to repeatable results. 
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CHAPTER III 
PROBLEMS OF POTENTIAL BARRIERS 

The logic of the S. method and its applications may be understood 
most readily by a consideration of a very important class of problems 
those of the transmission of electrons through potential barriers. The 
recognition that it is possible, on the basis of the new point of view, for 
electrons, possessing a given kinetic energy E, to penetrate into a region 
for which the potential energy U exceeds E this conclusion, derived 
on the basis of the S. equation, has enabled us to interpret such ob- 
servations as those on radioactive disintegration and the emission of 
electrons from cold cathodes in the presence of high field strengths. 

3.1 Reflection of Electrons at Semi-Infinite Potential Barriers. The 
simplest problem, which at the same time illustrates the difference 
between the conclusions which would be drawn from arguments based 
on classical mechanics and those derived on the basis of the S. equation, 
is that of a beam of electrons incident on a boundary, at which the 
value of the potential energy increases abruptly from to U. 

Let us consider a homogeneous beam of electrons of kinetic energy E 
moving along the a;-axis from left to right, and incident at x = on a 
boundary at which there is a retarding potential U = UQ for all values 
of x ^ 0. Two distinct cases arise here : one, illustrated in Fig. 8, for 
which E > C/ ; the other, illustrated in Fig. 9, for which E < UQ. 
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Fia. 8. Semi-infinite potential FIG. 9. Semi-infinite potential 

barrier; Case I. E > UQ. barrier; Case II. E < U . 

In Case I (E > t/ ), according to classical mechanics, a particle 
passing from left to right would merely be slowed up in crossing the 
boundary at x = 0. On the other hand, in Case II (E < [7 ) there 
would be complete reflection. While quantum mechanics also shows 
that there is a difference in the behavior of the particles in these two 
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cases, certain conclusions are deduced which are different from those 
predicted by classical mechanics. 

Case I. E> C/ . 

Let us designate the region to the left of x = 0, (U = 0) by I, that 
to the right of x = 0, (17 = C/ ) by II. Let fa and fa denote the wave 
functions for each of these regions. Each function will satisfy a S. 
equation, which will be of the form 

r + 



for region I, and 

o (2) 



for region II. 
The solutions of these equations are, respectively, 



(3) 

where 

8?r fjJEl . o OTT /x f TT \ 

cf = - ; and ft = -TTJ- \j uo)' 

Evidently, a. = 2ir/Xi, and ft = 27r/X 2 , where Xi and X 2 are the 
de Broglie wave lengths in each region. 

Now we introduce the boundary conditions which have to be satisfied 
by <h and < 2 , in order that the solutions shall be physically significant. 
The first condition is that at x = 

, . (z\ 

<7)l == Q) 2 \*J / 

That is, there can be no discontinuity in the value of the wave function 
at the boundary. 

The second condition is that there shall be no discontinuity in the 
slope, that is, at x = 0, 



(6) 
dx dx 



These two conditions are necessary in order to insure the continuity 
in the nature of the wave function as it crosses the boundary at x = 0. 
These are the conditions which are used in classical theory in dealing 
with the behavior of waves at a boundary. Furthermore, we know 
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from the physics of the problem that the incident beam of electrons may 
give rise to two streams, one of which represents the electrons that are 
transmitted from region I into region II, and the other represents the 
electrons reflected at x = 0. From the considerations advanced in 
the previous chapter, it follows that the incident beam must be repre- 
sented by At"**, the reflected beam by Br*, and the transmitted beam 
by C*P*. Since there are no electrons moving from right to left in 
region II, D = 0. 

Introducing these conclusions into (3) and (4), along with the two 
boundary conditions, equations (5) and (6), we obtain the relations: 

A + B = C 
a(A - B) - ftC. 
Hence, 



C = -T 

a + 
and therefore, 



-L - ' < ?) 

a + /3 



Thus, if the concentration, that is the number of electrons per unit 
distance, is equal, in the case of the incident beam, to A 2 , then it follows 
that 




(a) the concentration in reflected beam - A 2 1 

Vx-M 

4a 2 



(b) the concentration in transmitted beam - A 2 



(a + 



To obtain the coefficient of transmission T and that of reflection R, 
we must compare the " currents " in the three beams. The rate at 
which the electrons strike the potential barrier is equal to the sum of the 
rates of transmission and reflection. 
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Now current = (concentration) X (velocity). 
For region I, the velocity v\ is given by 



\E 



while for region II, 



Therefore, 



t>2 



_ P(E- 
"\ M 



. 

2*7* 



4a 2 



4a/3 



(a 



a (a 



(9) 



(10) 



and evidently R + T = 1, which satisfies the physical requisites. 
Replacing a and by the corresponding values in terms of E and t/o 
it is readily shown that 



(ID 
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Fia. 10. Coefficients of reflection R and transmission T 
for electrons as a function of E/U* 

Table I gives values of R for different values of U^/E as derived by 
means of equation (11), and Fig. 10 shows plots of T and R as functions 
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E 
V Q 


0.1 


0.0007 


10.0 


0.2 


0.0034 


5.0 


0.4 


0.0161 


2.5 


0.5 


0.0296 


2.0 


0.6 


0.0506 


1.67 


0.7 


0.0852 


1.43 


0,8 


0.1459 


1.25 


0.9 


0.270 


1.11 


1 


1.000 


1 



Classical mechanics predicts the result R = for all values of E ^ U ', 
quantum mechanics states that even for E = 2.5 UQ, 1.61 per cent of 
the incident electrons are reflected. 

Case II, E < U . 
As in the preceding case we consider the two S. equations: 



(12) 



_ 



and 



h 2 



0. 



The solution of (12) is, as before, 

<t>i = At + 



(13a) 



(14) 



In equation (13a) U is greater than E and, hence, the momentum 
and corresponding de Broglie wave length are imaginary. If we write 
the equation in the form 



- 0, (136) 

ax~ nr 

where U$ Eis a positive quantity, the general solution may be written, 
by analogy with equation (#.20), as 

* 2 - C/* + De* x , (15) 

where 



(16) 
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Equation (15) represents the sum of a function Cf x which increases 
exponentially with increase in x, and another function JW* which 
decreases exponentially in the same region. Now purely physical con- 
siderations show that if any electrons actually penetrate into region II, 
as is indicated by the solution fa, then the concentration of these 
electrons must decrease rapidly with increase in x. Hence, we must 
put C = 0. 

Applying the boundary conditions stated in equations (5) and (6), 
we deduce the relations: 

A + B - D, 

ia(A - B) - -f3D. 
That is, 



-- D. 
a 



Therefore, 

A 



=!(-!) 
-iH> 



That is, A and B are complex conjugate quantities. 
Hence, 



8 

D cos ox D - sin 
a 



0^f(, 

a \ 




where 



REFLECTION OF ELECTRONS 53 

The average concentration per unit length (Co) in the region x < 
is obtained thus: 
Since the de Broglie wave length is given by the relation 

h 2ir 



Let x = 0/a. Then we can write cos 2 ax = cos 2 6, and the limits of 
integration are and TT. Since 



o 2 

C = l + - 2 | A | 2 - 2 | B | 2 . (18) 



Equation (15), with C = and Z) real, shows that there is a definite 
probability for the occurrence of the electrons in the region U > E. 
The total probability of the occurrence of electrons in this region is 



The ratio P/Co gives, as E. U. Condon 1 terms it, "a kind of mean 
depth of penetration of the particles into the non-classical region." It 
is evidently 

P a 2 

CT 0(a 2 + /3 2 ) ' (20) 

and varies from f or E = to oo as E tends to become equal to C7 . 
The particles, of course, do not stay in region II indefinitely but ultimately 
return to region I. Also it is evident that with the increasing value of 
the ratio Uo/E (corresponding to increasing values of ft/a) the relative 
probability of penetration, as defined by equation (20), decreases until, 
for = oo , this probability becomes zero, that is, there is no penetration 
of particles into the " forbidden " region. 
In this case, since <i = for x = 0, it follows that 

j. _ A (-fax _ ~ictx\ 
91 = A (6 ; 

= 2Ai sin ax. 
1 E. U. Condon, Rev. Modern Phys., 3, 43 (1931). 
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That is, the eigenfunction represents two streams of electrons of equal 
intensity but in opposite directions. Thus, in the case E < C7 , the result 
obtained for R, the reflection coefficient, is the same as in classical 
mechanics. There is, however, this difference. Whereas the classical 
treatment states that penetration of electrons into the region C/ > E is 
forbidden, quantum mechanics states that there exists a definite prob- 
ability for the occurrence of the phenomenon and that this probability 
becomes vanishingly small as the ratio 0/a tends to infinity. 
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FIG. 11. Form of eigenfunction for penetration in region E < U. 

Figure 11 shows a plot of <fo and <fo for the case J7 = 2#. These 
curves were derived as follows: 



ax = 



and r = - 

A 



where 



Also 



Hence, in this case, 



Thus cos 6 = sin 8 = l/X/2; 5 = 2ir/8, and the corresponding value of 
r is 0.125. 
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It follows that 

01 - J9V2 cos 2v(r + 0.125), 

while 02 = De~ 2vr . 

For r = 0; <h = fa = D, 

. d<t>i d02 

and -T~ = 27rD = 



dr 



dr 
D 2 is the concentration per unit 
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It will also be observed that C 
length in the region U = 0. 

3.2 Electrons between Semi-Infinite Potential Barriers. A problem 
which may be considered at this stage is that of an electron in a potential 
" box." 2 This is illustrated in Fig. 12, for the case in which U = for 
the region #= atox = +a, and has the value U > E at all points 
(extending to oo and + < ) outside 
this region. What will be the behavior 
of an electron confined in such a region 
which we assume to extend to infinity 
along the y- and ^-coordinates? 

The problem is of importance for at 
least two reasons. Firstly, the behavior 
of an electron confined between two po- m- 
tential barriers, such as those assumed in FIQ 12 Illustrating be havior of 
the present case, must present similarities electron in potential "box." 
to the behavior of an electron bound by 

the electrostatic field around a positively charged nucleus. Secondly, 
the problem resembles, in its simplest aspects, that of electrons in a 
metal. In this case, as is well known, the kinetic energy of the electrons 
(E) begins to exceed the work function (F ) for emission, only at 
higher temperatures, and the increase of emission with temperature 
corresponds to an increase in the number of electrons in the metal which 
possess a kinetic energy greater than or equal to the work function. 

In the case shown in Fig. 12 we have to consider three regions: (1) 
that extending from x = -a to x = +a; (2) that from x = +a to 
x * oo ; (3) that from x = -a to x = - oo . Let us write 



- X), 



where, as in the previous case, since UQ > E, ft 2 is a positive quantity. 

2 The solution of this problem is given by J. Frenkel, " Einfuhrung in die Weilen- 
mechanik," pp. 52-55; "Wave Mechanics," Vol. I. 
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The solutions of the S. equation are given by 

0, - AJ** + Br 



which makes IX vanish for large positive values of x, and 



which makes < m vanish for large negative values of x. 
Applying the continuity relations for x a, we have 



From these relations and from similar relations for x = a, we obtain 
the four equations 



ict(At taa - Be- 1 ' 00 ) = -pert* (iii) 

ia(Ae"- {ao - J5* ao ) = $DC** (iv) 

which is a system of four linear homogeneous equations in the unknowns 
A, B, C, and D. 
From (i) and (ii) by addition, 

2(A + B) cos aa (C + D) ^ . (v) 

From (iii) and (iv) by subtraction, 

2a(A + B) sin aa = j8(C + D)f**. (vi) 

Hence, ^ 

tan aa = - (21) 

a 

Again, from (i) and (ii) by subtraction, 

2i(A - 5) sin aa - (C - D) ^, (vii) 

and from (iii) and (iv) by addition, 

2ia(A - B) cos aa = -/5(C - D)6"^ a . (viii) 

Hence, 

tan aa = - (22) 

p 

If (21) is valid, then it follows from (vii) and (viii) that 

A = B] C = D = 2AJ a cos aa. (23) 
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From (22) and (v) and (vi) it follows that 

A a -5; C = -D = 2Ai<P a sin aa. (24) 

We thus obtain two sets of values of the constants, which satisfy the 
boundary conditions of the problem. The corresponding eigenfunctions 
in the case of equations (21) and (23) are 

fa = 2A cos ax (a ^ x ^ a) 

fai = 2 A cos aa e""^*"" ' (# > a) 

fan = 2A cos aa ^ (aj+o) (x < -a) 

While in the case of equations (22) and (24), the eigenfunctions are 

0! = 2iA sin ax 
fat = 2iA sin aa 

sin aa * < 



(25) 



(26) 



Equations (21) and (25) thus represent one type of vibration, in which 
forz = 0,#x = 2A, and for x = a,fa = 2A cos aa, while (22) and (26) 
represent another type of vibration in which, for x = 0, fa = 0, and 
for x = +a, X = 2Ai sin aa, while, for x = a, 0! == 2Ai sin aa. 
For the first type of vibration #j has the same value for +x and x 



(a) 




(b) 



FIG. 13. Symmetrical eigenfunc- 
tions for electron in " box." 




(b) 



FIG. 14. Antisymmetricai eigen- 
functions for electron in " box." 



(see Fig. 13) and is known as an even function, while for the second type 
of vibration, fa is antisymmetrical with respect to a change from 
+x to a; (see Fig. 14) and is known as an odd function. 5 

8 Figure 14 represents, of course, the real function 2A sin ox, which is the modulus 
of the function X in equation (26). That is, 0i?i = (2iA sin ax) (-2iA sin ax) = 
4lA| 2 sin 2 ax. 
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It will be observed that in both these cases there is a definite prob- 
ability of penetration into regions for which f/ > E. These prob- 
abilities are determined by <n$n and ^m? m as functions of x, which 
are real magnitudes in both the symmetrical and antisymmetrical cases. 

In the limit, for /3 = , that is UQ infinitely great, e~ 2 ^ = 0, and 
there is no penetration into the regions outside the " walls " at +a and 
a. Under these conditions equation (21) requires that tan aa = QO 



That is, aa must be equal to an odd multiple of 7r/2, and therefore 



where n = 0, 1, 2, etc. 
Since 



it follows that 

lVl 2 -^Y^ (27) 

2a ) 8M~W8/ (27) 

where m = 2n + 1, is an odd integer. 

Similarly equation (22) requires that cot aa oo for /3 = oo . 
That is, aa must be equal to an even multiple of ?r/2, and therefore 



where n = 1, 2, etc., and m = 2n is an even integer. 

Equations (27) and (28) signify that an electron between two poten- 
tial boundaries, C/ > E, will behave in much the same manner as a 
stretched string. The electron in the box cannot have a continuously 
varying set of values for the kinetic energy E, but rather these energy 
values mil form a series of discrete values, which for UQ infinitely large 
compared with E is defined by the relation 

. 

where m = 1, 2, 3, etc., and d = 2a, denotes the distance between the 
two boundaries. 
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These are the eigenvalues corresponding to the eigenfunctions: 

<fc = 2Acos^, (30) 

A 

and 



A = 2Aisin - , (31) 

A 

where 

de Broglie wave length, 




= = (32) 

m m 

Thus B (the even function) corresponds to the odd values of m, while 
< A (the odd function) corresponds to the even values of m. These 
relations are evidently identical with those deduced previously for the 
wave lengths of the vibrations of a stretched string. 

Figures 13 and 14 show the symmetrical (or even) and antisymmetrical 
(or odd) eigenfunctions, respectively, corresponding to values of m 
ranging from 1 to 4. It will be observed that the number of nodes (that 
is, points at which < = 0) is always equal to m 1. 

We shall now attempt to interpret the meaning of <t> s or < A . According 
to the point of view of quantum mechanics, <t^dx is the probability of 
occurrence of the particle or particles in an element dx at the point x. In 
the present case 

0k& = 4A 2 cos 2 , 

and 2 w x 

0A?A = 4A 2 sin 2 

A 

These functions are designated as probability distribution functions, 
and from their form it is evident that they are always real and positive. 4 
Hence, the value of 



r 

/ a 



<l>$dx is real and positive. 



Now, if we wish to describe an experiment in which we know that 
there is just one electron in the box, we must determine the value of A 

4 Plots of these functions are given in Pauling and Wilson, " Introduction to 
Quantum Mechanics/' p. 97. 
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to be such that the total probability of finding the electron in the region 
x = a to x = +a is unity. (This is, of course, applicable only to 
the case in which U^/E =00.) Hence, 



xa /< 2wx 

I 08<Wz = 1 = 4A 2 I cos 2 

/ a t/ a A 



dx 



X 

4A 2 / cos 2 



The limits x = a and a = a are evidently identical with the limits 
x = and x = 2a, and these are identical with the limits 6 = 2irx/\ = 
and 6 47ra/X rwr, as shown by equation (32). 

Since 



f T cos*edff= f sin 2 0d0 = J. 

t/0 ^0 * 



/o 

it follows that 



2 

and 2A. = -~F~ = <v /- (33) 

Vm\ \d 

The same result is obtained from (31) by considering the value of 

This procedure, which consists in determining the value of the co- 
efficient 2A which will make the total probability equal to unity, is 
known as normalization of the eigenfunction, and when equation (30) 
is written in the form 



/0/IN 

--cos- (34) 

a A 



the right-hand side is said to be the normalized form of the function, 
while X/d/2 is known as the normalizing factor. In this equation, X varies 
inversely as VE, according to equation (32), and both d and X could be 
expressed in terms of the eigenvalue E mt thus: 

2 ** (35) 
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where m = 1, 3, 5, etc. The normalized eigenfunctions (< A )m will 
have the same coefficients with w = 2, 4, 6, etc., and the sine replacing 
the cosine functions. 

As N. F. Mott has pointed out, 6 the eigenvalues E m are very close 
together if d is of ordinary dimensions. But as d is decreased toward 
atomic dimensions, the value of the lowest energy level E\ increases 
and the spread between levels also increases. 

These conclusions are evident from a consideration of equation (29). 
For an electron in a box for which d = 2 cm., 

= *L (6-55 X 1(T 27 ) 2 
m " 4 " 8 X 9 X 1(T 28 

= m 2 X 1.49 X 1(T 27 erg. 
On the other hand, f or d = 2 X 1(T 8 cm., 



m 2 X 1.49 X 10- 11 



= 9.15 m 2 electron volts 



1.591 X 1(T 12 
= 9.15 e.v. for m = 1 
= 36.60 e.v. for m = 2, etc. 

(The electron volt is the kinetic energy acquired by an electron when 
accelerated through a potential difference of 1 volt.) 

The energy for the excitation of the first energy level in atomic 
hydrogen is equivalent to 10.12 electron volts, which is of the same order 
of magnitude as the energy corresponding to m = 1 for an electron 
in the case d = 2 X 10~ 8 cm. As mentioned previously an electron 
bound in a hydrogen atom by the field due to the nucleus resembles an 
electron between potential barriers. Thus, the solution of this latter 
problem accounts, qualitatively at least, for the experimental observa- 
tion that the electron in the hydrogen atom exhibits a series of discrete 
energy values. 

An interesting mathematical deduction from equations (30) and 
(31) should be mentioned in this connection. 

Starting with the function 

2wx mirx 

0m = C08 = COS , 
5 N. F. Mott, "An Outline of Wave Mechanics," p. 61. 
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let us consider the integral 



r 

1 

J 



2 * 

cos - cos 
2o 2a 



where mi is not equal to m 2 , (mi ?* m 2 ). 
From trigonometrical formulas it follows that 



-J cos 



COB|(WI - ma) 



2o r 2 " . \. .TO; 1 

- I d sin \ (mi + m 2 ) \ 

hm 2 )J I 2aJ 

2a 



a f 2 * . f, .} n 

I d sin] (mi - m 2 ) = 0, 

~ m 2 )J I 2aj 



m 2 ) 

since sin (mi + ^2 V = sin (mi m 2 )7r = sin = 0.' 

The eigenfunctions <t> mi and ^w,, where mi ^ w 2 are said to be 
orthogonal to each other, and we can generalize this result by the state- 
ment that the different eigenfunctions, corresponding to different values of 
m, form an orthogonal set, that is 



/ 

t/O 



m 2 ), 



where the integration is carried out over the whole range in which the 
functions have physical significance and the bar over the second function 
indicates that when <t> mi is a complex function, the conjugate complex 
of the second function is to be used in the integration, since only in this 
manner can the product be made to indicate a real value. 
On the other hand, for any normalized eigenf unction, as shown already, 



/ 

/0 



1. 



The trigonometric functions constitute the simplest types of orthogonal 
functions. Thus, we have 

xt2ff p2v 

I cos mx cos nxdx =1 sin mx sin nxdx 

/0 ^0 



o 

r 2 * 

I sin mx cos nxdx 
Jo 

for m T* n, (36) 
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whereas 



/2T /2ir 

I cos 2 nxdx = I sin 2 nxdc = ?r. 
/o /o 

The set of functions 



(37) 



sin x sin nx cos a? cos nx 



1 






-> case II 


t 






1 




1 


S 


U 




I 


1 


*x 


xo a 

T . *l-TT ,U, TT1 ,,, 



form a normalized orthogonal (or wthonormal) set in the interval 

tO 27T. 

In a subsequent chapter it is shown that the different eigenfunctions 
0i> 02> 0m > corresponding to the 
eigenvalues E\, E 2 , . . . E m . . . which are 
obtained as solutions of the S. equation for 
any form of the function U(x), form an or- 
thogonal set. This is also true for the case in 
which U and <t> are each functions of two or 
more coordinate variables. 

3.3 Transmission of Electrons through Po- FlG - 15 - illustrating pene- 
tential Barrier of Finite Extent The simplest tration of particle through 
case of a potential barrier of finite thickness is 
that illustrated in Fig. 15, where U = for 
x < 0, U = UQ for the region x = to x = a, and U = for x > a. 

Case I. E > U 

We now write down the S. equation for each of the three regions and 
the corresponding general solutions. 



rectangular potential bar- 
rier. 



(Region I) 



<te 2 



The corresponding solution is 

0. 

(Region II) 



dx 2 



where 
Solution is 



0. 



+ Be-*"*. 



0, 



j>(g - Up) 

3? 



(Region III) 



= 0. 
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Solution is 

, j? . j&x 

9m r , 

since the beam of electrons is transmitted from region I through II and 
III, and there are no electrons moving from right to left in region III. 
Postulating as in the previous problems that, at x = 0, 



and at x = a, 



dx dx 



dx dx 



we derive the relations 



(38) 



The number of electrons incident per unit time on the barrier at 
x = Ois 

, 



A 



where | A | 2 denotes the concentration per unit length of electrons 
traveling in the direction of increasing values of x, and VQ is the velocity, 
defined by the relation /xv = VfyE = ha/(2ir). (As mentioned pre- 
viously, the notation | A \ 2 is used to designate the numerical value 



This incident beam is partly reflected at x = 0, and the rate at which 
electrons are reflected is given by 
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The rate at which the electrons are transmitted into the barrier is 
given by 

where 

-TTT W 



This transmitted beam is partly reflected at x = a, and the rate of 
reflection is 



while that of transmission into the region x > a is 



Thus, the net transmission coefficient for electrons incident on the 
barrier at x = is given by 



Since only ratios of constants occur in (39) we will put F equal to 
unity, and hence the transmission coefficient for electrons through the 
barrier is given by | 1 /A | 2 . From relations (38) it follows that 



In this equation we may replace the exponential term by cos 2a0 = 
cos (4ira/X) where X is the de Broglie wave length inside the barrier, 
corresponding to the momentum pv = /1/3/2T. 

As E is increased more and more with respect to C7 , jS asymptotically 
approaches a, and | A | 2 = 1 in the limit. That is, there is complete 
transmission of electrons through the barrier. 

Case II. E <U<> 

According to classical mechanics, the particles incident at the barrier 
are completely reflected. Quantum mechanics leads to a different 
solution, which is of extremely great physical significance. 
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The solutions of the S. equation for this case will differ from those 
given for case I in this respect, that j3 corresponds to an imaginary 
de Broglie wave length, and hence we have the three relations: 



(x < 0) 0, = Ae*"* + 

(0 < * < a) 

(x > a) m = e te , 

where the coefficient in the last relation is put equal to unity for the 
same reason as in case I. 

Putting in the conditions valid at x = and x = a, we obtain the 
following values for the four constants: 

A - ^ [V + r*) + (- j-)^ 8 - r*)]; (40) 

a'i 

B 



In these equations, AAv Q represents the " current " for the incident par- 
ticles, BBvQ that for the reflected particles, and VQ that for the particles 
transmitted through the barrier at x = a. 

It can be shown very readily from equations (41) and (40) that 



while the probability that a particle coming up to the boundary at 
x = shall " tunnel " through the barrier is 



where P(E/U) designates that this probability is a function of the 
ratio E/U. 
From equation (40), it follows that 

{ / V 1 

cosh /3a + - ( ~ ) sinh |8a 
2 \<x p/ J 
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while 

I = r (cosh 0a - ~(~ - ~)sinh /3a[ 
Hence, 

A A = cosh 2 0a + 7 (~ - ^ sinh 2 0a. 
4\a j3/ 

For [7 very much greater than E, fi/a. is very large, and cosh |8a = 
sinh |3a. Hence we obtain the approximation 



1 .- 

4 2 



AS. =7-^2 sinh 2 0a, 



16 

/B\ 

and 



18 



Except for values of E/U Q which are near 1 or 0, the exponential 
term is the important factor, and the equation indicates that except for 
values of E, small compared to those of [7 , there is a finite and measur- 
able probability that a particle will get through a potential barrier if 
the latter has a width a which is of the same order of magnitude as 



27rV2 M (C/o - E) 

Thus, let us consider an electron for which, if (C7 - E) is expressed 
in electron volts 7, 

* 3.89 X 1(T 8 

t cm ' 



Assume E 1, and C7 = 10 in electron volts, then 

\ = 1.30 X 10T 8 cm., 

P 
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and we obtain the following values of P for different values of a : 
; a x lo 8 P 

1.30 0.248 

1.95 0.089 

2.60 0.035 

6.50 10- 4 approx. 

Thus P decreases rapidly as a is increased, and, for values of a > 10 A, 
P becomes infinitesimally small; but for atomic dimensions, the prob- 
ability is evidently fairly large. 

This phenomenon of the penetration of particles through potential 
barriers, or the " tunneling effect," as it has been designated, is one of 
the most important deductions contributed by the new quantum 
mechanics. R. W. Gurney and E. U. Condon 6 have described the 
significance of this conclusion as follows: 

In classical mechanics the orbit of a particle is entirely confined to those points in 
space at which its potential energy is less than its total energy. This is not true in 
quantum mechanics. Classically if a particle be moving in a basin of low potential 
energy and have not as much total energy as the maximum of potential energy 
surrounding the basin, it must certainly remain there for all time, unless it acquires 
the deficiency in energy somehow. But in quantum mechanics most statements of 
certainty are replaced by statements of probability. And the above statement 
must now be altered to read ". . .it may remain there for a long time but as time 
goes on, the probability that it has escaped, even without change in its total energy, 
increases towards unity . . . ." 

For instance, let us consider a particle of mass M and the total energy 
E, moving in a range where the potential energy function V(x) is of 
the form shown in Fig. 16. If E is less than the maximum value for 

the height of the "hill" between the two 
"valleys," then according to classical me- 
chanics there would be two different types of 
motion possible for the particle, each of which 
. i >i u ii > . fl con g nec j fc one O f the regions I or II. This 
FIG. 16. Illustrating penetra- motion would be of the nature of a vibration, 

with different fre ^ encies in each re *T 

But from the new point of view, There 
is no longer a definite correlation between simultaneous value of posi- 
tion and momentum as implied by the equation 



Instead of regarding the particle as moving with definite velocity at 
8 R. W. Gurney and E. U. Condon, Phys. Rev., 33, 127 (1929). 
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any given point, we forget about the particle and consider, instead, the 
properties of the associated wave motion and its " amplitude " function 
< which is a function of both x and E. The square of this function, or 
rather <t>$dx> is interpreted as giving the probability that the particle 
lies between x and x + dx, when it is in the state of energy E. " This," 
as Gurney and Condon point out, " is really the ground for requiring 
that <t> remain finite. For an energy level, such that <t>(E y x) does not 
remain finite as x > db , the probability that it is not at infinity is 
vanishingly small, and therefore these states do not exist physically. 
Adopting the probability interpretation of <j>(E, x) one has at once the 
result that there is a finite probability of being outside the range of 
classical motion of that energy." 

The point of view thus emphasized in the remarks from the paper 
of Gurney and Condon and illustrated by the problems which have been 
solved in the previous section has received a number of important 
applications. As a preliminary to the consideration of these let us 
consider the following problem. 

For the electrons confined between two semi-infinite barriers, it was 
shown that, for values of E not too small compared with those of [/, 
there is a definite probability of penetration into the regions U > E. 
If now these latter regions are decreased in extent until they become of 
atomic dimensions, the probability becomes measurable that the elec- 
trons which have entered the barriers will also pass completely through 
them, and this probability will assume a definite value for each discrete 
energy state E m of the electrons in the " box." To indicate this we 
may denote the probability by P(E m ). 

Without attempting to solve this problem in detail, it is readily seen 
that, as in Fig. 12, there will exist a series of discrete energy states, 
designated by E\, E% . . . E m , and that the corresponding eigenf unctions 
<t>(E my x) will be of the nature of sine and cosine functions of the angle 
ax, where 



Furthermore, it is evident that, if 6 denote the width of the region of 
atomic dimensions, for which U > E, the probability of penetration 
will be given, according to equation (42), by a relation of the form 



(43) 



4*6 



where 2/36 ^~V2^(U - E), (44) 

n 
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and F is a function of the ratio E m /U. Except for values of this ratio 
approaching zero or unity, the controlling factor governing the value of 
P will be the exponential term. 

For those cases in which the value of U in the region U > E is a 
function of #, denoted by U(x), the value of the exponent 206 is given, 
approximately, by the expression 

2/36 = ^ f V2(U - E) dx, (45) 

h J 

where the integration extends across the barrier between the two limits 
at which V(x) E = 0. It is readily seen that for U(x) = C7 in 
the range x = a to x = (a + 6), and U(x) = in the other 
regions, this integral gives the result stated in (44). 

3.4 Application to Radioactive Disintegration. As first shown by 
R. W. Gurney and E. U. Condon 7 and, independently, by G. Gamow, 8 
the theory of the penetration of particles through potential barriers 
of atomic dimensions gives a very satisfactory interpretation of the 
phenomena of ejection of alpha and beta particles from nuclei of the 
radioactive elements. 

As is well known this emission of charged corpuscles occurs spon- 
taneously and leads to the formation of a new atomic species. The rate 
of disintegration of any nuclear type is given by 

-dN = Nydt, 

where N is the number of nuclei which are unaltered at the end of any 
given period t, and y is known as the decay constant. This leads to 
the exponential law of decay 

N(t) - N Q r, 

where N Q is the number present at t = 0. It is customary to put 
7 = I/T where T is known as the mean " life " of the nucleus. 

For any given nuclear type, the alpha particles emitted are all of the 
same velocity t;, or at the most possess two or three discrete values of v. 
On the other hand, the velocities of the beta particles form a continuous 
range of values. For the emission of alpha particles we have the em- 
pirical relation, known as the Geiger-Nuttall law, according to which 

log 7 == A + B log v, 

where A and B are constants. That is, the life of a nuclear structure is 
shorter, the higher the velocity of the emitted alpha particles. 

7 R. W. Gurney and E. U. Condon, toe. cit. 

8 G. Gamow, Z. Physik., 51, 204 (1928). 
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In order to account for these observations we have to consider the 
nature of the potential energy function for an alpha particle in the 
neighborhood of the nucleus. From experiments on the scattering of 
alpha particles it is known that the inverse square law of repulsive 
forces applies to within a distance of about 10~~" 12 cm. from the nucleus. 
If we denote the charge on the nucleus by +Ze and that on the alpha 
particle by +2e, the force, as a function of the distance r, is given by 
2Ze 2 /r 2 , and, therefore, as will be shown in Chapter IV, 



r2^_. dr= 2Ze_ 
Jr r r 



This is indicated by the hyperbolic portion of the curve in Fig. 17. 

However, in order to account for the fact that alpha particles remain 
within the nuclear structure for a definite period of the order r, it is 
necessary to assume that very close to the nucleus the field changes from 




1234567 
FIG. 17. Potential energy curve for alpha particle in nucleus of radioactive atom. 

repulsive to attractive. Therefore, the function U will be represented 
very close to the nucleus by the dotted portion in the figure. That is, 
the nucleus is surrounded by a potential barrier of maximum height U Q , 
and if the energy of the emitted alpha particle be represented by E < UQ, 
then according to the arguments which have been given in the previous 
section, there exists a definite probability for the penetration of the 
particles through the barrier. The following remarks on the calculation 
of this probability follow closely the presentation of Gurney and Condon. 
Evidently the probability of emission of an alpha particle will be 
equal to the reciprocal of the mean time T, during which a particle 
remains in the region (I) close to the nucleus before " leaking through " 
to the outer region (II) where it experiences a repulsive field. For a 
particle of kinetic energy E, the velocity for U = is v = V2E/p. 
Hence, the amount of time spent by the particle in unit distance for x 
very large is vV/(2J57), and the time spent in a range of length a is 
therefore aVn/(2E). 
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Now according to quantum mechanics, as shown in equation (43), 
the probability of penetration is of the order 



where P is defined as the ratio between the probability of occurrence of 
the particle in unit length of the region II and the probability of occur- 
rence in unit length of the region I. " Therefore since the motion is 
aperiodic and the particle escaping from the range I will in the mean 
only go through unit length of II once, the time r which must be spent 
in range I before getting through to range II is of the order 



where 2/36 is defined by equation (45) and a is of the order of the breadth 
of range I." 
Hence, the decay constant is given by the relation 



1^1 I2E 

r a \ M 



(46) 



where the value of 2/36 depends upon the distance between the maximum 
value of U and the horizontal line corresponding to the particular value 
of E for the emitted alpha particles. It is evident from this conclusion 
"that if the size of the potential barrier be increased by a small 
factor the probability of escape may be decreased more than a million 
fold." 

To test this theory, Gurney and Condon have calculated the relative 
lives of Ur, RaA, and RaC' from the observed values for the kinetic 
energies of the emitted alpha particles, which are 6.5 X 10~ 6 , 9.55 X 
10"" 6 , and 12.2 X 10~ 6 erg, respectively. These values of E are indi- 
cated by the horizontal lines in Fig. 17, while the potential energy 
function for an alpha particle with respect to the nucleus is indicated 
by the curve. From the areas of the portions between this curve and 
the horizontal lines, values of the exponential factor in equation (46) 
were calculated, and these were then used to calculate relative values of 
7, the decay constant. The agreement between these and the observed 
relative values was found to be very satisfactory. Thus, " it is the 
energy of the emitted alpha particle which determines its own rate of 
decay," as had been concluded previously from the Geiger-Nuttall 
relation. 

3.5 Emission of Electrons in Presence of Intense Fields. It has 
been shown by a number of investigators that, with extremely high 
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electric fields, electron emission from cathodes occurs even at room 
temperature. This is known as the " cold-cathode " or " auto-elec- 
tronic " effect. An explanation of the phenomenon on the basis of the 
quantum mechanics has been given by J. R. Oppenheimer 9 and by 
R. H. Fowler and L. Nordheim. 10 

As mentioned previously electron emission occurs from a cathode 
when the kinetic energy is equal to or exceeds 
the work function. With ordinary electric 
fields the potential energy function for the HT 
electrons resembles that shown in Fig. 9, I 



and the kinetic energy becomes sufficiently x * a ^ 

high to overcome the barrier only at higher FlG ' ! 8 Nurtmtiiig "cold- 
A e ^ cathode "emission of elec- 

temperatures. Bat in the presence of ex- trons 

tremely high electric fields, even at ordinary 

temperatures, the potential energy function is represented by the plot 

in Fig. 18 where 

U(x) = for x < 
U(x) = C - Fx for x ^ 

and C denotes the work function, that is, the value of the potential 
energy at x = 0, while F is the field strength (corresponding to the 
voltage gradient). 
The S. equation which is to be solved in this case is of the form 

(47) 



where E, the kinetic energy of the electrons, is less than C. This is a 
differential equation which differs from those solved in the previous 
sections in that the coefficient in the second term on the left-hand side 
is not a constant, but a linear function of x. However, it is evident 
from the considerations advanced already that if the distance a, indi- 
cated in Fig. 18 as the width of the barrier, is of atomic dimensions, 
there will exist a probability of significant magnitude for the penetration 
of electrons through the barrier. 
From equation (45) it follows that this probability will be of the form 

P(E) = Ke- 2f * b , (48) 

9 J. R. Oppenheimer, Phys. Rev., 31, 66 (1928). 

10 R. H. Fowler and L. Nordheim, Proc. Roy. Soc., A119, 173 (1928); L. Nordheim, 
Physik. Z. t 30, 177 (1929). The investigations on this topic have been reviewed by 
S. Dushman, Rev. Modern Phys., 2, 381 (1930). 
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where K is a function of E/C, and 

2/36 = Y C * \/2 M (Z7 - E) - da? 



^ I** V2n(C - Fx - E) dx. 
* t/o 



Let C - E = A. Then 



- F(A - 
\2 



d (A - Fs)* | = ^ f\A - 

^ JO -6 J 

Fora = a, Fa = C S = A. Therefore, 



and 



(49) 



/i 3F 

It will be observed that the constant 

1 a 
F"~C~ E 

takes the place of a, the extent of the barrier, which is the physical 
significance of 6 in the exponential term ~~ 2 ^ 6 . 

According to Fowler and Nordheim, the coefficient K in (48) is 
given " with sufficient accuracy " by the relation 

# = 41? 



so that the resulting expression for P(E) in (48) is very similar to that 
given in (42) for the probability of penetration where U > E is constant 
over the extent of the barrier. 

From these results, it is possible to derive a relation between the 
electron emission and the field strength which is in agreement with the 
experimental observations. 
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The theory of the penetration of electrons through potential barriers 
has also been applied by Fowler and Nordheim to account for the observa- 
tions on the increase in emission from tungsten when covered by a 
monatomic film of a more electropositive metal 
such as thorium. 11 In this case the potential 
energy function is of the form shown in Fig. 19, 
where Wb denotes the work function for the pure 




X,X 



1*2 



metallic surface, and W a denotes the work func- FlG 19 potential dis- 
tion when a monatomic film of thickness I covers tribution at surface 
the surface. The kinetic energy of the electrons of metal covered with 
in the metal is denoted by W, and it is to be monatomic film of 
expected that for values of W between those of ^ e en e j ectropositive 
W a and Wb there will exist a probability of e emen ' 
transmission of electrons through the film, which is of the form 



where ft - *fi(W b - W m ). 

h 

According to L. Nordheim and R. H. Fowler, 12 



where k Boltzmann constant = 1.37 X 1(T 16 erg/deg., and T = 
absolute temperature. 

In this connection the reader will find it of interest to study the paper 
by C. Eckart 13 in which the potential function V(x) is represented by an 
analytic function of the form 

V( ^- A ~* 

V (X) _ ax ,. _ 

where a = 2ir/d, and the " barrier " is represented by a curve which 
extends from z--dtox = +d, with the values V = at the first 
point and V = constant, for x ^ d. 

The resulting S. equation is of the hypergeometric type and the 
solutions may therefore be expressed in terms of the hypergeometric 
series. 14 

11 See Dushman, loc. tit., for a detailed discussion of the observations and their 
interpretation on the basis of quantum mechanics. 

12 R. H. Fowler, Proc. Roy. Soc., A122, 36 (1929). 
18 C. Eckart, Phys. Rev., 35, 1303 (1930). 

14 These series are discussed in most of the mathematical treatises mentioned at 
the end of Chapter II. 
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CHAPTER IV 
CLASSICAL THEORY OF ATOMIC DYNAMICS 1 

4.1 Relation between Quantum Mechanics and Classical Dynamics. 

Mention has been made in previous chapters of the Hamiltonian form of 
expression for the total energy of a system. The formulation of the S. 
equation for any given system involves, as a first step, the description 
of the system in the Hamiltonian form, and it is therefore necessary to 
know how to deduce the latter for any type of system involving two or 
more particles and two or more coordinate variables. Furthermore, 
we shall find that a number of the concepts, such as the Principle of 
Least Action, which are of fundamental significance in classical 
dynamics, have received an extended application in the new quantum 
mechanics. The Hamiltonian canonical equations find their analog 
in the Poisson bracket type of relations which Dirac has utilized in his 
technic of operators, and in numerous other applications of the new 
quantum mechanics we find illustrations of the extension or modification 
of conceptions which were developed in connection with classical 
dynamics. 

It is for this reason, and because we believe that the new ideas cannot 
be grasped fully except in the light of the older developments, that this 
chapter has been written. The important fact that must be realized 
is that the new quantum mechanics represents a type of evolution from 
the Newtonian dynamics a development, which is a logical conse- 
quence of the Principle of Indeterminacy, of the fact that whereas the 
classical methods were intended to deal with macroscopic phenomena, 
the newer modifications are needed in order to deal adequately with 
atomic phenomena. 

4.2 Kinetic and Potential Energy. The laws of classical dynamics 
are based upon Newton's three laws of motion, of which the second is 
the most important since it gives us a measure of force. 

Let x denote the distance traversed by the particle of mass M at time 
t y under action of a force F, and let v denote the velocity at this 
instant. 

Then, according to Newton's second law, the force is defined as the 

1 This chapter is based (with modifications at different places) on the author's 
discussion of this topic in Taylor's " Treatise on Physical Chemistry," pp. 1264-1280. 
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rate of change of momentum, that is 



In non-relativity mechanics, /* is regarded as constant (independent of 
t;), and therefore, 



where X is known as the acceleration. 
We can write this equation in the form 

dv _ 1 ,/ 2 \ 
Fdx = n -row = rMd(ir). 
at 2 

If v and Vi denote the values of the velocity at XQ and x\> respectively, 



The integral on the right-hand side denotes the work which is done 
upon the particle by the force F, and the left-hand side denotes the change 
in kinetic energy T. Thus, equation (2) signifies that the work done 
upon the particle is equal to the increase in kinetic energy of the latter. 
In general the system under consideration will consist of n particles 
moving under the action of impressed forces in three-dimensional space. 
Applying equation (1) to each of these particles we obtain the n sets of 
relations 

W% - Xi = 

rtt-Yi = 

MA - Zi = 



(3) 



where /z, is the mass of the ith particle, and X if Y^ and Z t - are the three 
components of the force acting on this particle. 
The kinetic energy of the system is given by the relation 

T - EW*? + fi + *& (*) 

*'! 

where = dxi/dt, and the summation is taken over the n particles. 
It follows that i 




and similar equations apply to yi and 2 t . 
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In the case of conservative systems (which is the only type that will be 
considered in this chapter), it is possible to determine a function of the 
3n coordinates Xi yi z 1 . . . x n y n z n , designated by F, which possess 
the property that the force components are equal to the negative partial 
derivatives of V with respect to the corresponding coordinates. That is 

dV 



(6) 



where i = 1, 2, . . . n. We can write this set of relations in the form 

+ Yidyt + Zidzi) = -d7, (7) 



thus indicating that the left-hand side of the last equation is an exact 
differential. 

From equations (6) and (5) it follows that we can express Newton's 
second law in the form of the following equations, of which there are 
3ft for the n particles: 



dt\dx 





dt\dz 



Now let us consider the set of equations (3). Multiplying these 
equations respectively by the arbitrary small displacements 2 to,-, dy t , 
and dZf, and taking the sum of the displacements for all the particles of 
the system, we obtain the relation 

- X<)te, + (m9i - YjSyi + G& - Zi)SZi J = 0. (9) 
2 The symbol S is used to indicate an arbitrary infinitesimal variation. 
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This relation is the analytical statement of what has been designated 
d'Alembert's Principle. This was first published in 1743 and has been 
regarded as the basic equation of dynamics. In fact, " Lagrange made 
this principle the basis of the entire subject of dynamics." 3 

In equation (9) let us introduce the substitutions SXi = &idt] Sj/; = 
dZi = z t dt. This yields the relation 



- (X l x i + Ytfi + Z&)dt = 0. (10) 
But 



and 



Hence equation (10) becomes 

ift + dF-O. (11) 

at 

Integrating between the limits t = and t = t, this last relation 
becomes 

T< - To + V t - V Q - 0, 
that is, 

T + V = E, (12) 

where E is a constant which defines the total energy of the system. 

As an application of equation (10) or (12), let us consider the motion 
of a single particle along the re-axis under the action of a force F which 
is a function of x. Hence, we can determine V as a function of;#. 
Given the total energy E, we obtain the relations 



_dx_ B( 
"*" V 



(13) 
tw. > n 

and hence 

T Z *<r 

(14) 




M 
8 A. G. Webster, "Dynamics of a Particle." 
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For instance, for a body moving under gravity, 4 V = ngx, where g 
is the gravitational constant. Therefore equation (14) becomes 

dx 

(i) 




Let 2 - 2J0/M - 2gx. 
Then efe = 2gdx, and 



z & 

For x = E/(iig), t = > which shows that this is the highest point 
from which the body begins to fall. Let #/(/*(/) = #o- Then we can 
write (ii) in the form 



J- (z - 



that is, 

(* - fo) 2 . (iii) 



In this connection it is essential to make some remarks regarding the 
signs of E and V in atomic dynamics. Obviously the kinetic energy 
T is always positive. With regard to the sign of V the case is different. 
Since measurements yield only the difference in potential energy with 
respect to some point in space, which is assumed to be the zero of 
potential, it is necessary in dealing with atomic systems to define 
arbitrarily the state for which V = 0, 

Now let us consider an electron (charge = e) at an infinite distance 
from a positive charge of magnitude Ne. As the electron approaches 
the latter the force of attraction increases in accordance with Coulomb's 
law, and the kinetic energy T increases. Since E is constant, V must 
decrease as r, the distance of separation, decreases. 

If now we assume that for r = <*>, V = 0, then V = 7(r) must 
become more and more negative as r decreases. That is, 






This shows that V will always be negative if the negative sign is used in 
the relation 



4 Slater and Frank, "Introduction to Theoretical Physics," p. 42. 
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That is, for attractive forces the negative sign must be used, and evidently 
a positive sign will indicate repulsion. 

Since V is negative the sign of E depends upon the absolute magnitudes 
of T and V. But in atomic systems, consisting of a positive nucleus 
and one or more electrons, energy must be absorbed by the system in 
order to remove an electron, that is, work must be done on the system. 
Hence, E as well as V must be negative. The same conclusion obviously 
applies to any other system in which attractive forces exist between the 
constituents. On the other hand, in the case of repulsive forces be- 
tween the particles of a system E must be positive. The importance of 
these considerations will become evident from illustrations which will 
be given both in this and succeeding chapters. 

4.3 Generalized Coordinates. Instead of specifying the position of a 
particle by rectangular coordinates, it is more convenient in many 
problems to use some other system of coordinates. For instance, the 
position of the point P may be specified, as shown in Fig. 20, by the 
distance r and the angles a, |8, y which OP makes with the three rec- 
tangular axes. Since 

x = r cos a 

y = r cos > (16) 

z = r cos 7 
it follows that 

cos 2 a + cos 2 j3 + cos 2 7 = 1. (17) 

Therefore the three angles are not independent, and the position of P 
can be specified by r and any two of the three angles. 





Y 

FIG. 20. Illustrating equa- FIG. 21. Illustrating cylindrical 

tion (16). coordinates. 

Other types of coordinate systems are often used in dynamical prob- 
lems because of greater convenience. In cylindrical coordinates the 
position of a particle is specified by the coSrdinates z, r, and 77, as shown 
in Fig. 21. Evidently 

x = r cos 17 (18a) 

y = r sin 17 (18b) 

z - z. (18c) 
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The most convenient system for problems of central orbits, such as 
those dealing with the motion of an electron about a nucleus, is that of 
spherical coordinates. These are indicated in Fig. 22a, where r == OP 
designates the radius vector, 6 corresponds to the " latitude," and 17 to 
the " longitude." The connection between the coordinates r, 6, 77 and 
the rectangular coordinates is given by the following relations: 

x = ON = OM cos 77 = OP sin 6 cos 17 = r sin 8 cos 17 (19a) 
y = MN = r sin sin 17 (19&) 

z = PM = r cos 6, (19c) 

where PM is the perpendicular from the point P to the plane OXY 
(the equatorial plane). 




FIG. 22a. Illustrating relation between rectangular and spherical coordinates. 

For the motion of an electron in the field due to two nuclei separated 
by a fixed distance, confocal elliptic coordinates are used, and in some 
problems still other systems of coordinates may be convenient. 

Corresponding to each of the coordinate variables used to specify 
the position of a particle there will be a velocity component. With 
rectangular coordinates these components are x y y, and i, and the veloc- 
ity in the direction OP (Fig. 20) is determined from the relation 



(20) 
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For the case of spherical coordinates it follows from equations (19) 
that 

x = r cog 6 cos 17 r sin 6 sin 17 17 + r sin 6 cos 17 

$ = r cos 9 6 sin 17 + r sin cos i\ 1} + / sin sin iy 
= r sin 6 + f cos 0. 

Hence 

2 = x 2 + y 2 + z 2 = r 2 + r> + r 2 sin 2 - i} 2 . (21) 

This result may also be deduced readily from an inspection of Figs. 
226 and 22c. 5 




M 



OM=rsin0 

FIG. 226. Projection on meridian 
plane. 




FIG. 22c. Projection on equatorial 
plane. 



Cylindrical and spherical coordinates are illustrations of a general 
class known as orthogonal curvilinear systems of coordinates which have 
certain common properties. Let these generalized coordinates be 
designated by #1, q%, and # 3 . Then it will be found that the three sets 
of coordinate surfaces qi = constant, q 2 = constant, and # 3 = constant 
intersect at right angles. For each of these systems, the element of 
distance ds is given by a relation of the form 

(22) 



5 Figure 226 shows a projection of the meridian plane containing the right-angled 
triangle OMP, while Fig. 22c shows a similar projection of the equatorial plane con- 
taining the right-angled triangle ONM . It is evident that r&O Ar is the area of the 
element of PQP'Q'. From this it follows that rA0 Ar r sin 17 is the element 
of volume. In rectangular coordinates, the element of volume is Ax Ay Az. Hence, 
in the limit, 

dxdydz = r 2 sin Qdrdedrj, 

and it is seen that r 2 sin is the coefficient by which drdedtj must he multiplied to 
convert it into an element of volume. See further remarks in Appendix- III. 
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where a\ 9 a% } and a 3 are coefficients involving one or more of the three 
generalized coordinates. Also the element of volume in terms of any 
orthogonal system of coordinate variables is given by 

dr = dxdydz = Vaia 2 a 3 dqidcfodcfe, (23) 

and the coefficient \/a\a^a^ is known as the discriminant for the trans- 
formation from rectangular to orthogonal curvilinear coordinates. 6 

In general, the positions, at any instant of time, of all the particles 
constituting a given system may be specified by / variables, which we 
shall designate by the letters 1, 92 ?/ For a system of n particles 
the maximum value of/ would be 3n, but usually relations of constraints 
will exist between the particles, so that / will be less than 3n. If these 
variables are so chosen that / is a minimum they are designated as 
generalized coordinates and / then corresponds to the number of degrees 
of freedom of the system. It is customary to designate these generalized 
coordinates by the letters <?i> (?2 <?/> and the corresponding generalized 
velocities are therefore #1, fa . . . ?/. 

In a system consisting of n particles, the rectangular coordinates of the 
ith particle will be expressed in terms of the generalized variables by 
functional relations of the form 7 

#2 q/) 
. 82 f ) (24) 

22 q/) , 

Hence any arbitrary small variation in Xi will be given by a relation 
of the form 

dXi = -dqi H -6#2 + H * &q/9 (25) 

with similar expressions for % and Sz t -. The partial differential 
coefficient dxi/dqi is used in this relation instead of the more exact 
expression dxi(q\, q% . . . ?/)/fl?i, and in equation (25) it will be observed 
that while 8xi designates a variation in the coordinate variable x lt the 
symbol x+ in the partial differential coefficient represents a function of 
the /generalized coordinates q\ . . . q/. 

6 For the derivation of equation (23) and discussion of these systems of coordinates, 
see Appendix III, and also the following references: Slater and Frank, "Introduction 
to Theoretical Physics," p. 200; Pauling and Wilson, "Introduction to Quantum 
Mechanics," p. 103 and Appendix IV; A. G. Webster, "The Dynamics of Particles 
and of Rigid, Elastic and Fluid Bodies," Chapter VIII. 

7 The following discussion is based on the excellent chapter, "Advanced Dynamics/ 1 
in L. Page's "Introduction to Theoretical Physics." 
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It follows that the velocity components are given by relations of the 
form 



with similar expressions for yi and 2. 

Equation (26) shows that the expression for i t - (fa or fy) is a homo- 
geneous linear function of the generalized velocities, with coefficients 
which are functions of the q's. Since 



* + + *?)' (27) 

it follows from (26) that I 7 is a homogeneous quadratic function of the 
3's. 

From this fact may be deduced another important result by applica- 
tion of Euler's theorem for homogeneous functions. This theorem 
states that if u is a homogeneous function of the nth degree of any 
number (/) of variables Xi, x 2 . . . x/ t then 

du , du , , du 

-*._ + *._ + ... + ,,._ 

Since T is a homogeneous function of the second degree in the general- 
ized velocities, it follows from Euler's theorem that 

\m \m *\m 

2r = ^ + ^ + -'- + *'^/ (28) 

4.4 Lagrange's Equations. In section 2 it was shown that Newton's 
second law of motion may be expressed in terms of rectangular co- 
ordinate variables by the set of equations (8). Can the laws of dy- 
namics be obtained in forms which are independent of the particular 
nature of the coordinate system used? This was the question which 
Lagrange, the great French mathematician, answered in his " M6canique 
analytique." As Lindsay and Margenau describe his achievement: 8 

Lagrange proceeded from very general considerations, endeavoring to reduce 
mechanics to pure analysis and emancipate it from the connection with geometry 
which had been one of its outstanding characteristics as developed by Newton. 
With triumphant e"clat Lagrange announced in the preface to his book that "there 
are no figures in this work," implying that all had been reduced to algebraic analysis 
(in the large sense). 

We shall now consider the method used by Lagrange in deriving the 
laws of dynamics in terms of generalized coordinates. 

8 Lindsay and Margenau, "Foundations of Physics," p. 136. 
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In equation (9) let us replace dXi by the expression derived in equation 
(25), and similarly for % and Sz, and for all the n particles. The 
resultant expression of d'Alembert's Principle is of the form 



a (29) 

Evidently, 



and from equation (26) it is seen that 



Hence, 

3V^a/ B *#V l '*"a4b/""2 <ft I dq k J 
Also 

dt \dq k / dqidtyb . $22^(Z& 

= A/^. +^i f . + ^ 

j 4 

= - from equation (26). 
Therefore, 

/ V - J ^ 1 

(32) 



Substituting from equations (31) and (32) in (30) it follows that 

..^.lA.^-AA^V 

^ dft dt\2 dq k j dq k \2 */ 
Consequently, 

'- (33) 



Now (|)Mt*i is the kinetic energy along the z-axis of the ith particle. 
If we write down relations similar to equation (33) for Mi&% and 
sum up the right-hand sides of these equations, it is evident 
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that the summation of the terms similar to the first term on the right- 
hand side of (33) corresponds to 



while the summation of the terms similar to the second term corre- 
sponds to T^dT/dcfc. Hence, we derive the result, 



- <tt * ftS= ' (34) 

since 

*7 -?* 



Since V is not a function of the #'s, it does not alter the sense of equa- 
tion (34) if we write it in the form 

$(B). -}*-* 

where L = T - V. 

Now Sqi, Sq 2 . . Sq/ are arbitrary variations. Consequently, 
equation (35) cannot be valid unless each of the / expressions in the 
brackets on the left-hand side vanishes identically. That is, for each 
of the f generalized coordinates there is valid an equation of the form 

-=<>. (36) 

dq k 

This is known as Lagrange's equation, and the function L = Z>(3fc, &) 
is known as the Lagrangian or kinetic potential function. 

From the method of derivation, it is evident that the Lagrangian 
equations are valid for any coordinate system, as long as the number of 
coordinates corresponds to the total number of degrees of freedom of the 
system of particles. 

For the simplest case, that of the motion of a single particle along the 
a>axis, equation (36) becomes 



dt 
where X = -dV/dx. 
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Since T = j* 2 /2, dT/d = /* = momentum, and dT/dx = 0, 
Lagrange's equation assumes the simple form 

M* - X . 0, 

which is Newton's second law of motion. 

4.5 Motion of Electron in Bohr Atom in Absence of Quantizing 
Conditions. 9 By means of the Lagrange equations it is possible to 
derive the equations of motion for an electron in the field due to a 
nucleus of charge + Ne. This is a typical case of central field motion 
and is described most conveniently in terms of the spherical coordinates 
r, 6, and rj. The equations for the transformation from rectangular 
coordinates have been given in (19), and the corresponding expression 
for the kinetic energy of the electron is given, in accordance with equa- 
tion (21), by the relation 

T = J (P + r 2 6 2 + r 2 sin 2 9 i) 2 ). (37) 

2t 

The potential energy function, as shown in section 2, is given by the 
relation 






(39) 



? -/d- 8 an 8 *- * (40) 



dL dT dV /A2 , . 2n 2 , Ne 2 

~ = " = Mr ( + S1 ' * ^ "" " 



= jitr 2 sin cos ij 2 (42) 

30 



= (43) 

&} 

9 The discussion in this section follows that given by Lindsay and Margenau, 
"Foundations of Physics," pp. 13&-140. 
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Consequently, the Lagrange equations are as follows: 
d SdL\ dL , Ne 2 



d , .. o . .o ~ /..x 

^" 11 ' 008 '^- ' (45) 



* 1 '-*- - (46) 

In consequence of the last equation, 

jur 2 sin 2 0-ij = C, (47) 

where C is a constant of integration. 

Eliminating 1) from equations (44) and (45) by means of (47) we 
obtain the two equations of motion 






The fact that the problem is thus reduced to one involving only two 
degrees of freedom shows that the motion occurs in a plane and may be 
represented in terms of the variables r and 6. Consequently, we can 
take the plane as that for which t\ = 0, and we have 17 = and C = 0, 
so that equations (48) and (49) assume the simpler forms 

A/> 2 
/zf+ -^ 2 = 0, (50) 



and G^-O. (51) 

The last equation leads to the very important conclusion 

M r 2 d = M r 2 ~=a, (52) 

where a is a constant, and equation (52) states that the area described 
per unit time by the radius vector, which is equal to (|)r 2 0, is a constant. 
This constitutes one of Kepler's laws for planetary orbits. 
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Substituting this result in equation (50) wo obtain the second-order 
differential equation for the radial motion of the electron in the form 

' (53) 



Multiplying through by 2dr/dt, we obtain the equation 



d /dr\ 2 

-'i)- 



a 2 



dt\dtj 
which, on integration, gives the equation 



Afr\ 2 2N<? o? 
W pr + /A~ ft 



where ft is an integration constant. 
Hence, 

dr 






Combining this with equation (52), and thus eliminating dt y we obtain 
the differential equation for the orbit in the form 



(55) 



2Ne 2 a 2 

ir ' '\P "' r V 

Introducing the variable 

a Ne 2 

it is readily shown that the integral of equation (55) is 

+ ^A ' cos (0 + 7), (56) 



where 7 is an integration constant which may be equated to zero, since 
it involves only the location of the polar axis with respect to the axis 
of reference. 

Comparing equation (56) with the polar equation 10 for a conic section 
in the form 

- = 1 + cos 0, (57) 

r 

10 See supplementary note 1 for derivation of equation of conic section in terms of 
polar coordinates. 
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where I is the semi-parameter, and e the eccentricity, it is seen that 
equation (56) represents a similar curve for which 

(58) 



and < "' 1+ ' (59) 



For c < 1, the curve represents an ellipse; for e = 0, a circle; for 
> 1, the curve represents a hyperbola; while for = 1, it corresponds 
to a parabola. It is therefore necessary to determine the physical 
significance of the integration constant ft. 

Now 



Substituting from equations (52) and (54) it follows that 
p, / 2Ne 2 a 2 a 2 

That is, 



Since E is negative for a central orbit, 11 it follows that ]8 is negative, 
and therefore e, as defined by equation (59), is less than or equal to 1. 
That is, the orbit is an ellipse or circle. 

Let W = -S, so that W is a positive magnitude. Then it follows 
from equations (58), (59), and (60) that 

_ 2 = 2 <* 2w = Wl 
~N 2 e*~Ne 2 ' 

But 2Z/(1 e 2 ) = 2a, the major axis of the ellipse. 
Therefore 



(61) 

That is, the energy depends only upon the magnitude of the major axis 
of the ellipse and is independent of the eccentricity. 

11 See supplementary note 2 for discussion of this statement. 
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In section 8 it will be shown how Bohr introduced a quantizing con- 
dition and thus derived the conclusion that the value of 2a, the major 
axis, cannot vary continuously (corresponding to a continuous varia- 
tion in E) but that the electron can revolve only in a series of dis- 
crete orbits, corresponding to a series of discrete values of E. 

From equation (52) the period of revolution (T) of the electron in its 
orbit may be deduced as follows: 

The area of an ellipse, for which 2a is the major and 26 the minor 
axis, is given by wab. It follows from equation (52) that 

a irdb 
that is, 

o ~l> . .. 

(62) 



a 

But 

b = aVl - <?, 

and from equations (59) and (60) it follows that 



o? ~ M tf 2 e 4 ~ M 
Substituting for 6 and a in equation (62) it follows that 

2W 1 




Substituting for W from equation (61), we obtain the two relations 




and r = 7= - 5 (645) 



Equation (64a) corresponds to Kepler's third law for planetary orbits 
which states that the square of the period of revolution varies as the 
cube of the semi-major axis. Equation (646) gives r in terms of the 
total energy E W. 

In deducing these relations the assumption has been introduced 
implicitly that the nucleus is at rest. Actually both the nucleus and 
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electron move in elliptic orbits about their common center of gravity. 
The motion of the electron is then described in terms of a particle about 
a fixed center if the " reduced " mass MO> defined by the relation 



. 

- as -- 1 -- > 

MO /* M 
is used, and the radius vector r is defined by the relations 



where r = r + r 2 . In these relations /* and M designate the mass of 
electron and nucleus respectively, while ri and r 2 represent the dis- 
tances of electron and nucleus respectively from their common center 
of gravity. 12 

It is evident that for M very large compared to p the distance r 2 is 
vanishingly small, so that r ~ r\, and ju ~ M 13 

4.6 Canonical Equations. Since each Lagrangian equation is a 
differential equation of the second order, it is more convenient in dy- 
namical problems to replace these / equations (corresponding to / 
degrees of freedom) by 2/ differential equations of the first order. 

In order to derive these equations we introduce a new and very 
important variable pk, the generalized momentum, which is defined by 
the differential equation 

dL dT 

P* = TT -77- (65) 

oqk oqk 

Since V is not a function of <fo but L = T V is a function of 
51, 92 <?/, (h, 92, #/> and in the most general case of t as well, 
we indicate this by the relation 

L = L(q k , 4 to ). (66) 

In terms of the new variable, Lagrange's equations assume the 
simplified form 



12 For detailed discussion of this case the student should consult Ruark and Urey, 
"Atoms, Molecules, and Quanta," and Pauling and Wilson, "Introduction to 
Quantum Mechanics." 

13 The sign is used to designate approximate, but not exact, equality. 
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From equation (66) it follows that we can write the total differential 
of L with respect to the three sets of variables in the form 



Incorporating equations (65) and (67), the last equation becomes 



dL = ^Pkdqk + pkdfo + 'dt. (68) 

/ / 

Now let us introduce a function H, defined by the relation 14 

H I>fc& - L. (69) 



Then it follows from (68) and (69) that 

^ f 1 dL 

dH = 2^ \ Pkddk + fadpk ~~ PkdQk "" Pkdfa \ rr * dt 
-*? I dt 



.d<. (70) 

Since dH is an exact differential, equation (70) indicates that H is a 
function of the independent variables pj,, qk, and t, that is, 

H = H(pi, ft, 0- 
Hence, 



Comparing coefficients of dpj, dgj,, and dt in equations (70) and (71), 
we deduce the three first-order partial differential equations 

dH dq k . , 

^ = -' (72) 



dH A dpk . 

= i> k - - -r- 
dqk at 

A dH dL 

and ir = -* 

14 This is known as a Legendre transformation. 
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These equations express the equations of motion in the so-called 
canonical form, and p k and q k are designated canonically conjugated 
variables. The function H = H(p k , q k ), in which t does not enter as 
an independent variable, is known as the Hamiltonian, and we shall 
now proceed to determine the physical significance of this function. 

In this case, dH/dt = 0, and we derive from equation (71) the relation 

dH ^SH . ^dH 

~S7 = ZiT~ ' V* + 2*T~~ * fa> 
dt fdp k ^dq k 

which in virtue of the canonical equations (72) and (73) becomes 
JTJ 
-jjj- = %4kPk - 2jp*& - 0. 



Hence, H (p k , q k ) - constant, is a ^rs< integral of the differential 
equations of motion (72) and (73). 
Furthermore, by the definition of H, as given in (69), 

H - Eftfc - T + V. (75) 



But, as deduced in equation (28), 

nfTT 

(76) 



Hence, equation (75) becomes 

H = 2T -T+y=r+y= #. (77; 



Thus, /or a wnservative system H = #(pjk, ?A:) is egtwiZ <o the tota 
energy. 

It is evident, from the method used in the derivation of the canonica 
equations, that if we have two sets of canonically conjugated variables 
q k , p k and Q kj P kj for which H(p k , q k ) = H (Q k , P k ) = E, then equa 
tions (72) and (73) are equally valid for both sets of variables. Thi 
conclusion leads to a theory of contact transformations which is of grea 
importance in both classical and quantum mechanics. 

4.7 Hamilton's Principle and the Principle of Least Action. Fror 
Lagrange's equations it is possible to deduce two principles which ar 
of extremely great importance in classical dynamics and have thei 
counterpart in quantum mechanics. 

By the methods of the calculus of variation it is shown that th 
Lagrange equations (36) are a necessary condition which must t 
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satisfied in order that the integral 

S =J" l Ldt (78) 

shall be a maximum or minimum (that is, a stationary value or extremum). 
For the integral S it can be shown that it is actually a minimum. This 
conclusion is usually described as Hamilton's Principle and is stated as 
follows. 

If we compare a dynamical path (that is, one which proceeds in 
accordance with the laws of dynamics) with varied paths which have the 
same termini (in the configuration space) and are described in the same 
time, then the time integral of the Lagrangian function has a stationary 
value for the dynamical path, so that 

(T - V)dt = 0, (79) 

where 8 denotes that the path is to be varied in any arbitrary manner, 
subject, however, to the conditions mentioned above. 

Since the integral S, defined in equation (78), has a definite value 
which depends only on the initial and final values of the coordinates 
(<7i> #2 $ an d <?!> 02 <?/) an d on the time interval ti t$, it 
follows that 

/*i 

& <i, o), (80) 

where S is known as Hamilton's Function. 

For conservative systems, the energy is a constant. Now let us 
compare two infinitely near paths, for each of which the energy is the 
same, and which are described in the same time interval ti < - Since 
each path is a natural one, and we shall assume that both have the same 
initial coordinates, it is evident that the final coordinates cannot be 
the same for both. 

From Hamilton's Principle it follows that for a natural path 



Ldt + 8 / Edt = 0. 

But L * T V, and E = T + V. Hence this relation may be ex- 
pressed in the form 

SA =dJ*' l 2Tdt = 0, (81) 

where A is a function, known as the Characteristic Function or Action, 
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which must be an extremum (actually a minimum) for conservative 
systems. The relation stated in the last equation is known as the 
Principle of Least Action. 

This relation can also be stated in another form, which proved con- 
venient in the classical theory of atomic mechanics. 

Since 

T 



where t> t - is the velocity of the ith particle of mass u^ it follows that 



f 
Jt 



(82) 



where efe t - denotes an element of the path. This relation shows that the 
Action corresponds to the sum of the line integrals of the momenta 
taken over the total path from the initial point s to the final point $1. 
For a single particle 

T = E - V, 



and therefore 

iw = V2p(E - 7). 
Hence, 

* 7) <fo. (83) 



If F is a function of /-coordinates, then ds is an element of " path " in 
the /-dimensional space. 

4.8 The Wilson-Sommerfeld Quantum Conditions. Equation (82) 
can also be deduced by making use of equation (76). Combining 
(76) with (81) it follows that 



= f 

J* 



' l 2Tdt = L pkqkdt - pkdqic. (84) 



Since the value of A depends only upon the initial and final values of 
the coordinates and the energy E, it is an integral invariant. For any 
system for which E is given, the value of A taken between any two 
points in the /-dimensional space (configuration space) must be a 
constant and therefore independent of the particular type of coordinate 
system. 

In particular, if we consider a periodic type of orbit, the value of the 
Action, or line integral of momenta, taken over a complete period of 
revolution, must be a constant. 
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Wilson and Sommerfeld therefore postulated that for atomic systems 
this integral, known as the Action integral, and designated by J& (one 
for each degree of freedom or coordinate variable) must be equal to 
an integral of Planck's constant h (which has the dimensions of action). 
That is, the Wilson-Sommerfeld quantum conditions are of the form 



f 



= n k h. (85) 

The circle around the sign of integration indicates integration over 
the orbit for a complete period r, where v = 1/r is the frequency of the 
particular motion, and n k is an integer, which may have a different value 
for each coordinate variable. 

Thus, for an elliptic orbit of the electron in the hydrogen-like atom, 
the quantum conditions are given by 

Jo = pedO = n e h (86) 



J r = yp r dr = n r h. (87) 

In this case, as discussed in section 5, 



E = 

Therefore, ^ 

Pe = ^ = 

and p r = v>r- 

Hence, E - Hfo Pr ) = g + ^ 2 - ^ , (88) 

and applying the canonical equations corresponding to 0, 

dH * / \ 

- = 6 (i) 

dpe 

f - = -*, (ii) 

Ov 

that is, 

PQ = constant = a, 

as deduced in equation (52). 
Hence, 

= 2ira 
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In the case of the coordinate r, 

--6 OB) 



The last equation is identical with equation (53), and the solution 
has been given in section 5. In order to apply the quantum condition 
it is necessary to evaluate the integral 

J r = n r h = 2 



/ r m 

! I p r dr, (v) 

/r0 



where r and r m denote the minimum and maximum values respectively 
of the radius vector. 
Now from equations (54) and (60) we have the relation 



Let us consider the significance of the function Q. It is evident 
that 



9 jtYC> ^ 

= _ r2+ _. r _ 



= -r 2 + {a(l + e) + a(l - >} r - a 2 (l - 6 2 ). 

In deducing these relations use has been made of equations (61) and 
(63). Now from the properties of the ellipse it follows that 

a(l + ) = r m ; a(l - e) = r . 
Consequently, 



Q - fr - ro) (r. - r), (89) 

and it is seen that Q = for r = r and r = r w . 
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The evaluation of 



is obtained most readily by application of the theory of functions of 
complex variables, 16 and the result is 



r o -f ij <> N ** \ 

r m 2m 1 + - 1 
\2ir V-2JJS' 



Hence 



, + Je? ~ (n r 



where # = Rydberg constant (in terms of wave numbers) 



$ n = energy of atomic system in state of total quantum 
number n, 

n = n r + n$ = an integer, 
and c = velocity of light. 

More strictly, the problem of the hydrogen-like atom should have 
been treated as a system of three dimensions, that is, by use of spherical 
polar coordinates. 16 

When the three quantum conditions (involving an additional one for 
the angle ry) are applied, it is found that the value of E is the same as that 
derived in (90) with the only difference that n, the total quantum 
number, is the sum of three quantum numbers n r , HO, and n n . That 
is, E is independent of the individual values of these three numbers and 
depends only on the value of their sum. Thus, for the system in terms of 

15 See A. Sommerfeld, "Atombau und Spektrallinien " (Ed. 1924), pp. 772-779, 
and J. H. Van Vleck, "Quantum Principles and Line Spectra," pp. 193-196. See 
also Peirce's tables, formulae 161, 183, and 187. This problem is also discussed 
by L. Page, " Introduction to Theoretical Physics," pp. 567-570, in terms of the 
Action and Angle variables. 

16 See J. H. Van Vleck, "Quantum Principles," p. 193 et seq., and M. Born's 
"Atommechanik," Chapter 111, for a comprehensive discussion of this problem. 
In classical mechanics the solution is more conveniently obtained by application of 
the Hamilton-Jacobi partial differential equation. 
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two variables, we have the possibilities indicated in the following table, 
in which n$ is replaced by the more customary symbol k. 

5 

Type of orbit 









or * 


a 


1 





1 


Circular 


1 


2 





2 


Circular 


1 




1 


1 


Elliptic 


0.5 


3 





3 


Circular 


1 




1 


2 


Elliptic 


0.67 




2 


1 


Elliptic 


0.33 



and similarly for larger values of n. 
Since, as shown in section 5, the total energy is given by the relation 

B - > 
En ~~2a n ' 

it follows from equation (90) that the semi-major axis of the orbit in the 
state of quantum number n is given by 

Ne* 



N 



(91) 



where ai is the radius of the orbit in the normal state of the hydrogen 
atom. This is usually designated by the symbol a , where 

h 2 



a 



= 0-5283 X 1(T 8 cm. (92) 



Making use of the relations deduced in section 5, it is evident that 

J - 27T 



Hence, 

G+j?-?- 1 -'' w 

and -==-, (936) 

n a 

where 6 is the semi-minor axis of the ellipse. 
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The last relation follows from equation (vii) in supplementary note 1. 
The values of b/a given in the above table for the different orbits show 
that, with increase in the value of n r = n fc, the orbits become more 
elliptical. 

In the cases n = 2, n = 3, and so forth, it is thus possible to have two 
or more orbits with the same value of E. Such states are known as 
degenerate, and in the first case we speak of a twofold degeneracy, in the 
second, of threefold degeneracy, and so on. It is only in the presence of 
electrostatic or magnetic fields that this degeneracy is removed. We 
shall meet with a similar situation when discussing the hydrogen-like 
atom in wave mechanics. 

4.9 Mean Values for Central Orbits. In some of the problems involv- 
ing central orbits it is of importance to calculate the mean value of a 
given magnitude M. By definition the mean value is given by the re- 
lation 

M = - f T Mdt, (94) 

r Jo 

where r is the period of revolution, and a bar across the symbol designates 
the mean value. 

For the purpose of comparison with the results deduced by application 
of the S. equation it is of importance, in connection with the problem of 
the hydrogen-like atom, to calculate mean values of r and 1/r for the 
electronic orbits in the Bohr atom. 17 

From equation (94) we have the relations 



- / rdt (95) 

T/0 



\~r) = ~r 



** <W 



Now if in equation (54) we introduce the values of ft and a 2 given by 
equations (60) and (58) respectively, equation (95) assumes the form 

(97a) 



. 
r J'Q M? MFl ~~ 

I _ _. _ 2 ]/y 

\ r r 2 



where r and r m are the minimum and maximum values of r, and 
W = E. Proceeding as in the deduction of equation (89), we obtain 

17 For more comprehensive discussion see M. Born, " Atommechanik," p. 163 et seq. 
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V2W(r - r ) (r m - r) 

Let us now introduce the relation for r in terms of the eccentric an- 
omaly u (which is derived in supplementary note 1), that is, 

r = a(l cos u). 
Then r m - a(l + c), 

and PO = a(l c). 

Therefore equation (97b) becomes 

- JL * fl2 (l ~ 6 CQS 




r \2Wo ae sin u 






Introducing the value of r deduced in equation (64a), it follows that 



and substituting for a from equation (91), and for 2 from equation 
(93a), it follows that 



We shall now consider equation (96) which defines the mean value 
of 1/r. Proceeding as in the previous case, we obtain the relation 




- >-o) (r m - r) 
2 / u /* oc an u du 

-. / I _ . 

r \2FJo sin w 

2ir 

r 
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that is, 

/7\ i 

(99) 



(I).!. 

\r/ a 



This conclusion is obvious from the fact that the mean potential 
energy is given by 

7- -, 

2o ' 
and is therefore independent of 6. 



SUPPLEMENTARY NOTE 1 
EQUATION OF ELLIPSE IN POLAR COORDINATES 

In Fig. 23, OA = a is the semi-major axis, and OZ = 6, the semi- 
minor axis. The line QD, which is perpendicular to OA, is the di- 
rectrix; F is focus; and F(J, which is parallel to QD, is known as the 
semi-latus rectum (designated by J). 




FIG. 23. Illustrating derivation of equation for ellipse in polar coordinates. 

Let P be any point on the ellipse, and PQ the distance from QD. 
Let FD = d. 

By the definition of an ellipse, the eccentricity is given by the relation 

PF r 



= 



PQ d r cos 



That is, 



1 1 , cos 6 

~ r- -4- -" 

r td d 



(i) 



(ii) 



For r = Z, cos 6 = 0, so that I = cd. Therefore, equation (ii) becomes 



- = 1 + cos d. 
r 

106 



(Hi) 
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Let r and r m designate minimum and maximum values of r, re- 
spectively. 
For 

I 
r = FA = r , cos 6 = 1 and r = T - (iv) 



For 



r = FO + a = r w , cos = -1 and r m = (v) 



Hence, 

21 



Z 
Again OF = a 

1 



0(1 - e 2 ) 

= a = at. 

JL ~" 

Furthermore, 

6 2 = (ZF) 2 - a 2 c 2 = e 2 (a + ^Y - a 2 e 2 
= a 2 (l - 6 2 ). 

That is, ~ = Vl - e 2 . (vii) 

a 

The area of the ellipse is calculated most readily by using the 
equation in rectangular coordinates 

^2 + 12 ^ ! (^ 

Xo xo ' * 

ydx = 4 I 
t/a 



Area = 4 
Let 




- = sin 0; dx = a cos 
a 



Area 



f \ 

4ofe I cos 2 0d0 = vab. (ix) 

Jo 
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In calculating some of the integrals which occur in the investigations 
on central orbits it has been found convenient to introduce a new variable 
u, designated the eccentric anomaly. In Fig. 23 let P f be the point at 
which the perpendicular through P on the major axis meets the circle 
with radius a (the semi-major axis) drawn about as center. Then 
the angle P'OK = u. 

Evidently, 

FK = OK - OF = a cos u - a*. 

But from equation (iii) 

Z - r 
FK = r cos = 

6 

Hence, I r = ae(cos u *). 

Introducing equation (vi) it follows that 

r = a(l cos u). (x) 



SUPPLEMENTARY NOTE 2 

RELATION BETWEEN AVERAGE KINETIC AND POTENTIAL ENERGIES 
FOR CENTRAL ORBITS (VIRIAL THEOREM) 

The following proof is given by J. H. Van Vleck. 18 Equations 
(4.3) and (4.6) lead to the relations of the form 

dV 



Multiplying both sides by -x iy proceeding similarly with the equa- 
tions corresponding to yi and z,-, and summing over all the n particles 
of the system, we have 



Now 

A(~ . .\ 

= XiXi + x\ (ii) 



dt 

and so forth, and furthermore d(xiXi)/dt is equal to zero on the average over a very 
long time interval if the separating distances and velocities of the particles never 
become infinite, and if the center of gravity is assumed at rest. Therefore, the 
average of the left-hand side of (i) equals that of the expression ^m(x\ + y*+ a%) 
which is twice the mean kinetic energy T. [See equation (4-27).] Let the potential 
V be a homogeneous function of degree n + 1 in the coordinates. This will be 
the case if the forces of mutual action between particles vary as the nth power of 
the distance separating them. Then the expression on the right-hand side of (i) 
is by Euler's theorem on homogeneous functions [see equation (4-28)] equal to 
(n + 1) V. Consequently the desired relation is 

2T = (n + 1)F. (iii) 

For motion under the inverse square law, n = 2, and consequently 

2? = -7. (iv) 

It follows that for central orbits of this type 

F 

# = T+y=--. (v) 

z 

Since V is negative for all values of r < GO, it follows that E also is 
negative. 

18 "Quantum Principles," pp. 20-21. 

109 



110 CLASSICAL THEORY OF ATOMIC DYNAMICS 

COLLATERAL READING 

1. PAULING, L., and E. B. WILSON, JR., "Introduction to Quantum Mechanics." 

Chapter 1 gives a summary of the subject. 

2. PAGE, L., "Introduction to Theoretical Physics," D. Van Nostrand Co., 

New York, Second Edition, 1936. The student should consult Chapter IV, 
"Advanced Dynamics," and Chapter XV, "Origin of Spectra." The 
method used for deducing Lagrange's equations and Hamilton's Principle 
depends upon the application of a simple theorem in the calculus of variation 
which the author derives in the first section of Chapter IV. 

3. Joos, G., " Theoretical Physics," Translated by L M. Freeman, G. E. Stechert 

& Co., New York, 1934. Another invaluable work of reference on a large 
number of topics, including a section on classical dynamics and on wave 
mechanics. The method given in this treatise for deriving the Lagrange 
equations without the application of calculus of variation has been used in 
the preceding chapter, 

4. VAN VLECK, J. H., "Principles of Quantum Mechanics," National Research 

Council Bull., 1926. A very comprehensive discussion of the subject of 
classical atomic mechanics. 

5. LINDSAY, R. B., and MARGENAU, H., "Foundations of Physics," Chapter 3. 

The section on classical mechanics, like all the other sections of the book, 
should be consulted for a deeper understanding of the significance and im- 
portance of the fundamental concepts of dynamics. 

6. BYERLY, W. E., "An Introduction to the Use of Generalized Coordinates in 

Mechanics and Physics," Ginn & Co., Boston, 1916. Like all this author's 
mathematical treatises, the topics are presented clearly, although, again, 
from a different point of view from that adopted in the preceding chapter. 

7. TOLMAN, R. C., "Statistical Mechanics with Applications to Physics and 

Chemistry," Chemical Catalog Co., New York, 1927. Chapter II gives a 
concise presentation of classical mechanics. 

8. WILSON, W., "Theoretical Physics," Vol. I, E. P. Dutton & Co., New York, 

1931. Also a useful work of reference on classical mechanics. The student 
will find, in the chapter "Principles of Dynamics," another interesting method 
of deducing the fundamental relations. 

9. BORN, M., " Vorlesungen iiber Atommechanik," Vol. I, Julius Springer, Berlin, 

1925. A treatment of the classical dynamics and its application to atomic 
problems from the point of view of the Bohr theory. 

10. OSGOOD, W. F., " Mechanics," MacmiHan Co., New York, 1937. Contains a 

very comprehensive treatment of the subject, with no reference to atomic 
mechanics. 

11. WEBSTER, A. G., "The Dynamics of Particles and of Rigid, Elastic and Fluid 

Bodies," G. E. Stechert, New York. This classical treatise is an extremely 
valuable work of reference. 

12. WHITTAKER, E. T., "A Treatise on the Analytical Dynamics of Particles and 

Rigid Bodies," Macmillan Co., New York. For reference by the advanced 
student. 

13. SLATER, J. C., and FRANK, N. H., "Introduction to Theoretical Physics," 

McGraw-Hill Book Co., New York, 1933. 



CHAPTER V 
THE LINEAR HARMONIC OSCILLATOR 

6.1 Formulation of the S. Equation and Solution. The motion of 
a particle of mass n acted upon by a restoring force proportional to the 
displacement is defined by the differential equation (#.6) which has the 
form 

M ^+fcc = 0, (1) 

where k is a constant. The general solution of this equation, as shown 
in Chapter II, section 2, has the form 

x = VA 2 + B 2 sin (J + 5), (2) 

where co = Vk/n = 2vv^ and VQ denotes the frequency of vibration, 
while 5 is the phase angle. 

In order to describe the behavior of such a linear harmonic oscillator 
from the point of view of wave mechanics, it is necessary, as a first 
step, to express the total energy E in the Hamiltonian form, that is, as 
a function //(p, q) of the coordinates and corresponding momenta. 
As shown in equation (2.21), this relation has the form 

7> 2 ko 2 
E-H<p, q )= + . (3) 

where p = M (dx/dt) = momentum, and q is used instead of x to desig- 
nate the coordinate of position. Introducing the frequency of vibration 
VQ = Vfc//x/27T, we derive the relation 

ka 2 

E -U = E -~ = E - 2Tr*vfaq 2 . (4) 

2 

From equation (2.54) it follows that the corresponding S. equation has 
the form 
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Let .-. (6) 




and 6=^-p. (7) 

ft 

Hence, we can replace (5) by the equation 

- 0. (8) 



dq* 
Let us introduce the new variable 

(9) 




Since phvo has the same dimensions as p,E in the expression 
X = h/V^ilE, for the de Broglie wave length of a particle having kinetic 
energy E, it follows that Vb has the dimensions of a reciprocal length, 
and therefore x must be a dimensionless quantity, that is, a pure number. 

Also from (9) it is seen that 

d__ Vb-d 
dq dx 



Hence, the S. equation (8) becomes 



Now in order that <t> shall have physical significance, it is necessary 
to obtain such solutions of equation (10) as will permit <t> to be finite 
and continuous for all values of x ranging from <*> to + oo . While in 
the case of a linear harmonic oscillator, in ordinary mechanics, the 
particle may vibrate only between the limits x = dbVo/6 (since at 
these limits the value of (7, the potential energy, becomes equal to the 
total energy E), there are no such limitations to the range of values of 
x in which <f> may be finite. For, according to the Principle of Indeter- 
minacy, there is no correlation between simultaneous values of position 
and velocity; the particle may be anywhere along the coordinate axis x. 
But it is obvious that <#, the probability of occurrence of the particle, 
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as a function of z, must decrease continuously to zero as the values 
x ss oo are approached. It is the introduction of these " boundary 
conditions " that is essential for the solution of equation (10). 

It will be observed that for values of x \/o/6, equation (10) assumes 
the form 



Writing this in the " operator " form, 

(D 2 - x 2 )<t> = (D + x) (D - *)* - 0, 
it is seen that this equation is equivalent to the two first-order equations, 

d<t> 

= xdx, 

that is, 

?! 
2 

where In denotes logarithm to base . 

While this solution is only approximate, since the factorization of 
the operator (D 2 x 2 ) is not justifiable for the case in which x is a 
variable, it does indicate that the solution of equation (11) should 
involve <T X * 12 as a factor. (Evidently a solution involving e* 272 
would not be reasonable, since it would correspond to an infinite 
value of <t>(x) for x > <*> .) 

In consequence of these considerations we shall postulate a solution 
for equation (10) of the form 

<(aO = c 2 ^(aj), (11) 

where \l/(x) is a function of x, the nature of which must be determined 
from further considerations. 

From equation (11) it follows that 

dx dx 
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d*<j> d* - - 

while -_,.-,-.- ^ + '*? 



Since ~~* 2/2 is not equal to zero (except for x = o), we obtain, 
by substitution of the value for d?<t>/dx 2 indicated by (10) and (11), 
the equation to be solved in the form 



^_2*.^ + (?-lW-0. (12) 

dx* dx \b I 

Let us express ^ (x) in the form of a polynomial, 

\f/(x) = a n x n + an-i% n ~ l + . . . + a\% + a<). (l^) 

If <f>(s), as defined by equation (11), is to vanish for x = 00, it 
follows that x n i~ x2/2 must also approach zero for large values of x. 
As x becomes infinite, x n , as well as e* 2/2 , tends to become infinitely 
large, so that the value of the product x n r x * 12 is indeterminate. If, 
however, we make n finite, that is, let the series in (13) have no powers 
greater than x n , then it can be shown that x n r x212 will always decrease 
to zero as x approaches db . 

From equation (13) we obtain the relations 

2x = 2na n x n + 2(n l}a n -ix n ~~ l + 2(n 2)an^ ; 
dx 



= n(n l)a n x n 2 + (n 1) (n 
dx 2 



If we substitute these equations in (12) it follows, since this equation 
is valid for any value of x, that the coefficient of each power of x must 
vanish identically. This leads to the " recursion formula " 



(n + 2) (n + l)a n +2 +U - 1 - 2nja n = 0. 



(14) 



FORMULATION OF THE S. EQUATION AND SOLUTION 115 



The series will end with x n , if the coefficients a n+ 2t ^n+4> etc., are 
each equal to zero, that is, if 

J - 2n + 1, (15) 

o 

where n = 0, 1, 2, etc. 

It is evident that this deduction is analogous to that which was 
previously derived for an electron in a " box." The fact that (a/6) 
may assume only those values corresponding to the series of odd inte- 
gers 1, 3, 5, etc., indicates that the S. equation (10) can have solutions 
which are physically significant only for a series of discrete values of the 
energy E. These values constitute the characteristic energy values, cor- 
responding to a series of energy levels of the linear harmonic oscillator, 
and the corresponding solutions for <t> constitute a series of characteristic 
functions. 

Substituting for (a/b) in equations (6) and (7) it is found that the 
energy of the oscillator may assume any one of the series of discrete 
values defined by the relation 

E n = h (n + |), (16) 



where E n is therefore the eigenvalue corresponding to the eigenfunction 

**. 
On the basis of the older quantum theory, the energy states of the 

linear harmonic oscillator were deduced thus: According to equations 
(24) and (0.25) _ 

p = V2pE cos <d 



q = ^ / sin (at. 
^\i I* 

n/ 

Hence the action integral is given by 

/f /M C* 2 ,, . 4E T E 

jr = (p pcwy = 41?,* / I cos co d(wt) = = 
/ yk*J>ir w 2 VQ 

But J = nh (quantum condition). 

Therefore J = n/u> . 

On the other hand the S. solution leads to an additional term in the 
energy, of value hv Q /2. From (16) it also follows that the frequency 
in the nth state is 

Vn = VQ(TI + f ). 

This result is in better agreement with spectroscopic observations 
than that derived from the older theory. Furthermore, equation (16) 



116 THE LINEAR HARMONIC OSCILLATOR 

shows that the energy of a linear oscillator does not vanish at the 
absolute zero, but becomes equal to /u> /2, thus solving definitely the 
problem of the existence of a Nullpunktsenergie. 

We shall now proceed with the determination of the corresponding 
eigenfunctions. From equations (14) and (15) it follows that 



n(n - l)a n = -[2n - 2(n - 

= 2 2a^ 2j 
and that 

(TI - 2) (n - 3)a^ 2 - -[2n - 2(n 
- -2-2V-4, 
whence we obtain the expression for ^ n 

n(n - I)*"- 2 , n(n - 1) (n - 2) (n - 



n 



(17 
where a n is an arbitrary constant. 

Now in investigating various types of functions, Hermite, a note 
mathematician of the nineteenth century, discovered a set of function 
known as " Hermitian " (or Hermite) polynomials, which are define 
by the relation 1 



(IS 



where n = 0, 1, 2, etc. 

The expression defined by this relation is known as the nth Hermitis 
polynomial and is identical with (17) if we put a n = 2 n . The first fr< 
members of the series are readily determined. 



- 2. 
ax 



(-l)V 2 



- 16s 4 - 4&r 2 + 12. 

1 These functions and their properties are described more fully in the mal 
matical treatises of Courant-Hilbert and Frank.-v. Mises. 
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From equation (18) the following relations may be derived. 2 

. (19) 

(x) - 2xH n (x) + 2nH n - 1 (a) - 0. (20) 

The last relation makes it possible to derive the higher members of 
the series from the lower members. Combining equations (19) and 
(20), it follows that 

H+i(x) - 2xH n (x) + Hn(x) - 0; 
and differentiating with respect to x, 

H W '+1<X) - 2ff(x) - 2xff J(X) + ff'n (X) - 0. 

Hence, 

2(n + l)ffn(z) - 2H n (x) - 2*ff ,I(x) + #"(*) - 0. 
That is, 

ff "(x) - 2xHn(x) + 2wff n (x) = 0. 

Evidently the last equation is identical with (12), if we put 
(a/6) - 1 2n. 

Thus the eigenf unctions which satisfy the S. equation (10) are of the 
form 

J* 

< n = c 2 -H n (x). 

We now have to consider the physical interpretation of these functions. 
As mentioned previously, <t> n <!>ndx or <f>%dx (since in the present case <f> n 
is a real function) defines the probability of occurrence of the particle in 
the element of distance dx at the point x. It follows that 



-* 



is a measure of the probability of occurrence of the particle in the region 
x 2 > x > xi, and since the oscillating particle must certainly be lo- 
cated in the range x = =fc>, it is necessary that we introduce the 
normalizing factor N n , defined by the relation 



'00 

2/ 



In order to illustrate the method used in determining the value of 
N n and also for the purpose of demonstrating the orthogonal properties of 

2 Pauling and Wilson, "Introduction to Quantum Mechanics," pp. 77-82 give 
complete details of the derivation. 
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the eigenfunctions which satisfy the S. equation (10), it is necessary to 
introduce the values of certain definite integrals. 
In any standard table of integrals 3 it is shown that 



r- 



~. (21) 



r- 



It will be observed that, in the first of these integrals, x 2n+l changes 
sign with change of sign in x. The function e~ x2 x 2n * 1 is therefore 
of the type designated as " odd," and resembles the trigonometric 
function sin x. 

Thus the area under the plot of the function for the range x = 
to x oo is positive, while the area for the range z = to x = oo 
is negative, and consequently the total area for the range < to + is 
zero. That is 

= 0. (23) 

On the other hand, in the case of the second integral, the function 
c -*2 . X 2n j g a i wa y s positive, whether x is positive or negative. The 
function is thus of the type designated as " even," and resembles in this 
respect the function cos x. Consequently, 



r 

J-oo 



(24) 



By means of this equation and the expressions for H n (x) we derive 
the following values: 



/o = 

/- 



/ 



- 4 r'x^dx - 2v^ = N\ , 
2 (4 a ; 2 - 2) 2 (fo = 2 2 



where JV > -^i> JV*2 denote the normalizing factors corresponding to 
n = 0, 1, and 2, respectively. 

8 For example, L. Silberstein's "Mathematical Tables," G. Bell & Sons, London. 
See also Appendix III. 
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More generally, if we express </> n in the form 



where a n , a n _ 2? etc., are the coefficients in the power series of the nth 
Hermitian polynomial, it is evident that 

In = r<fi(x)dx =2 f" *~ x \a*x r + 8 dx, 

U 00 / 00 



where 2 denotes that the sum of the series of terms is to be taken for 
the range of values r + s = 2n, 2n 2, and so forth. Hence, r + s 
is always even, and it follows that I n must always have a definite value 

In - Nl 

The value of N n may be derived as follows. By definition, 



where H n is written instead of H n (x). From equation (18) it follows 
that 



Now let us put 



and consider the result of differentiating the product H n u. Since 

^(HnUJ-Hnu'+H'nU, 

we obtain by integration the relation 

ffJ = o - rH n u f dx + rH^udx. 

J 00 / 00 t/00 

That the first expression on the left vanishes is due to the fact that the 
presence of the term C** in all the derivatives of this function makes 
u (and consequently uH n ) vanish at x = *> . 
Combining the last relation with equation (19) it follows that 



H n u'dx = -2n f Hn 
) t/ GO 
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Thatis, 



dx 



- 2w X>--^?' 



da;- 2 ' 
and by continuing the same procedure, we finally deduce the relation 



Thatis, 



3 



(25) 



Figure 24 shows plots of the normalized eigenfunctions, that is, 
of <t>n/N n , for the values n = 0, 1, 2, 3, and 4. It will be observed that, 
for n = 0, 2, and 4, the curves are symmetrical with respect to a change 




FIG. 24. Eigenfunctions for lower energy states of linear oscillator. 

from positive to negative values of x (even functions), while for n = 1 
and 3, the curves are antisymmetric with respect to such a change in 
sign of x. Excluding the nodes at x = 00, the number of these 
is always equal to n. This arises from the fact that H n (x) is a poly- 
nomial in x n t and may therefore be expressed in the form (x a\) 
(x a 2 ) . . . (x a n ). Consequently H n (x) must have n roots, that 
is, n points along the axis of x at which H n (x) 0. 
We shall now consider the integral 



> - f tntmdx - / 

t/~ 00 / 00 



where m t's no< egwaZ to n. 
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Assume n = m + p. Then we can proceed to calculate I nm as 
in the case m = n, and we shall obtain the result 



<l>m+p<t>mdx 
i 



where K is a factor involving 2 P and the product n(n 1). . . (n m). 
Now from equation (18) it follows that 



and it is evident that the differential coefficient d p (e x *)/dx p will con- 
sist of a series of terms of the form a^V* 8 where r has the maximum 
value p, and the signs of the coefficients will be alternately plus and 
minus. If p is odd, it is evident from equation (23) that the integral 
I nm will be equal to zero. If p is even, it is not self-evident that the 
same conclusion will be valid, and a more elaborate proof is required. 
However, the result may be demonstrated by an actual calculation for 
a simple case. 

Thus, let us consider the product </>i<fa. From the values of the cor- 
responding Hermitian polynomials, it is seen that 

r0i<Ms = 8 fV* 2 (2z 4 - 3x 2 )dx 

t/__QO t/ CO 

_ 16 3V^ _ 24Vr _ Q 
2 2 2 

Similarly, 

4>^dx = rV* 2 (4* 2 - 2) (16s 4 - 48s 2 + 12)dc 
i t/ oo 

= P" (64e-' 2 X 2 - 224e- 2 x 2 + 1446-* 2 x 2 - 24r x ')dx 

J<0 

64 1 3 5 224-1.3 144-1 
^3 T + 2 - 24 - ' 

That is, the value of 4> n < m alternates between positive and negative 
values, as may be seen by inspecting the curves shown in Fig. 24, with 
the result that I n m = 0. This deduction may be stated in the generalized 
form that the solutions of the S. equation form an orthogonal set of eigen- 
functions. As mentioned in Chapter III this conclusion is valid for the 
solutions obtained for the S. equation in all cases, and may be demon- 
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strated to be a logical consequence of a very fundamental mathematical 
theorem. 4 As shown in subsequent sections, this property of the solu- 
tions of the S. equation is of extremely great importance in dealing 
with a number of problems which arise in quantum mechanics. 

5.2 Distribution Functions for Harmonic Oscillator. We shall now 
consider further the interpretation, on the basis of wave mechanics, 
of the behavior of a particle which, according to classical mechanics, 
executes harmonic vibrations with frequency VQ. 

From equation (9) it is observed that in terms of g, the actual dis- 
placement, the dimensionless variable is defined by the relation 

x = 

In classical mechanics, the maximum amplitude of oscillation is given 
by 




as is evident from equation (4), since, for this value of q, E U = 0. 
Substituting for E from equation (16) and using equation (9), the 
corresponding maximum value of x is found to be 



- 1. (26) 

Hence, the motion of the oscillating particle, from the classical me- 
chanics point of view, would be given by the relation 

x = V2n + I - sin (2W), (27) 



showing that x oscillates between + V2n + 1 and -V2n + 1. 

From this it is possible to calculate the probability Pdx that the 
particle will be found between x and x + dx* Since Pdx is propor- 
tional to the element of time dt required for the particle to pass from 
x to x + dx, 

Pdx = Adt, 

where A is a constant which satisfies the condition that 

rPdx = 1. (28) 

- 

4 A proof for this statement is given in the supplementary note 1 for the case in 
which the S. equation involves only one coordinate variable. 

5 The following remarks constitute an amplification of the calculation given by 
Condon and Morse, " Quantum Mechanics, " p. 51. 
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From equation (27) it follows that 

dx = V2n + 1 2irv n cos (2wv n t)dt 



= 2irp n (V2n + 1 - x 2 )dt. 
Hence, 

Adx 



Adt 



2irv n V2n + 1 - x 2 
Introducing equation (28), according to which 



\/2n+l 

Adx 1 

' 



_ 27ri/ n v / 2n + 1 - 

-V2n+l 

it is found that 

A = 2v n , 
and that therefore 



(29) 



. 
V2n + 1 - x 2 

This result shows that, for the classical oscillator, P increases from 
l/(7iV2n + 1) at x = to oo at x = x = \/2n + 1. On the other 
hand, S.'s solution leads to finite values of < n even for values of | x \ > \ XQ |. 
That is, < 2 has a definite value even for values of x (or q) which are 
forbidden by classical mechanics. 

In the region outside the classical limits, the potential energy U 
is greater than the total energy E, and hence the de Broglie wave length 
is imaginary. From the S. equation (10) it follows that 



Hence for x = x d 2 </dx 2 = 0, which shows that this constitutes a 
point of inflection. For values of \x\ > |x |, <A decreases continuously 
without exhibiting any nodes, and, as mentioned previously, for very 
large values of x, < varies as e""*^ 2 , that is, decreases rapidly without 
increase in x. 

Plotting the values of < 2 /N 2 against x, a series of curves such as those 
shown in Fig. 25 is obtained, and designated by A. (The significance 
of 1/JV 2 consists in the fact that it makes the area under each of these 
curves equal to f . Since similar curves, symmetrical with respect to 
the < 2 -axis, may be plotted for negative values of x } the total area under 
the curve giving <t> 2 /N 2 as a function of all values of x is equal to 1.) 
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The curves designated by B in Fig. 25 give the probability distri- 
bution function as calculated from equation (29), that is, according 
to classical mechanics. The difference in results derived from the two 
points of view is due, as emphasized in the remarks on the " tunneling 

effect," to the complete lack of as- 
sociation in quantum mechanics be- 
tween position and velocity. That 
is, we have here another illustration 
of the application of the Principle 
of Indeterminacy. 

That the probability for the oc- 
currence of the oscillating particle 

in the region \x\ > |V2n + 1| is 
FIG. 25. Probability of occurrence of considerable, is readily shown by 
oscillating .particle According to quan- calculati the yalue of (3/^2) 
turn mechanics (Curve A) and ac- ' ' 

cording to classical theory (Curve B). I ^2^ j n the cage of the firgt 

JxQ 

characteristic function, = ~~ x2/2 corresponding to E = 
Here we find 6 (since X Q = 1), 




_2_ 

Vlr 



0.1573. 



That is, there is a 15.7 per cent probability that the particle will occur 
in the region for which the potential energy exceeds the total energy. 

Curves such as those shown in Fig. 25 are known as probability 
distribution curves. As emphasized previously, the Principle of In- 
determinacy states that it is impossible to fix simultaneously the co- 
ordinates of position and the magnitudes of momenta. It follows 
logically that the solution of a S. equation can have only a statistical 
interpretation as regards each of the variables specifying position and 
momentum. The probability distribution curve represents the infor- 
mation about these variables thus derived. 

It will be observed that, in the graph of any such distribution curve, 
the ordinate is really a differential coefficient. Thus, in Fig. 25, the 
ordinate P should be defined as 

P / 
x I ^ 

,cte. 

where p is the area under the curve between x = and x. In the present 
case this area gives the probability for the occurrence of the particle 

6 See supplementary note 2. 
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between and x, where the total probability of occurrence from 
#= oo to a; = + oo is taken as unity. 

In classical physics the best-known curve of a similar nature is that 
which represents the Maxwell-Boltzmann distribution law for the 
velocities of molecules, which has the form 7 

1 dN 



In this equation N designates the total number of molecules, c the 
random velocity, m the mass of the molecule, k the Boltzmann con- 
stant, and T the absolute temperature of the gas. 

From this equation it is possible to derive values of the average 
velocity, the root-mean-square velocity, and the most probable velocity. 
Similar equations may be derived for the distribution of energy, and 
for distribution of velocities in any given direction. 

Another illustration is the plot representing the distribution of 
energy with frequency in the radiation from a black body at any given 
temperature T. 8 

This relation has the form 



dE 2?n> 2 hv 



where v = frequency, c = velocity of light, and h = Planck's constant. 
The total energy radiated is given by 



= r 

t/O 



E = I E v dv, 

t/O 

that is, by the area under the curve which gives E v as a function of v. 

6.3 Determination of Average Values. A problem which is of much 
interest is the determination of the average values of the coordinates 
(designated by g) or of the corresponding momenta (designated by p). 
The relations used for calculating these values in quantum mechanics 
follow readily from equations of a similar nature which have been de- 
rived in ordinary mechanics for calculating the center of gravity of a 
body. 

Thus, let us consider a rod of uniform cross section and given length, 
but of variable density p, so that p = p(x) is a function of x> the dis- 

7 This is discussed very fully in Tolman's "Statistical Mechanics." 

8 See 8. Dushman, Taylor's "Treatise," Chapter XVI. 
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tance from one end. Let XQ denote the coordinate of the center of 
gravity. The moment of any element dx about the center of gravity 
is given by the product, 

mass of element X distance = pdx (x a: ). 

Now the center of gravity is that point about which the total 
moment of all the forces acting on each element is zero. Hence, if 
I = length of rod, 







C l 

I P(X - 
/o 



or 



XI 
pxdx 
_- 

fpefa 

/0 



For example, if p - p #, that is, if the linear density is proportional to 
the distance x from one end of the rod, then 



*o= 



Po I xdx 

/o 



Similarly, in the case of two coordinates of position, the coordinates 
%o> 2/o of the center of gravity are given by the relations 

f f pxdxdy _ff py dxd y 

X = ffpdxdy ' * " ffpdxdy ' 

where p = p(x, y), and the integration is carried out over the whole 
area of the surface under consideration. It is evident that the de- 
nominator always corresponds to the total mass. The extension of 
these equations to three dimensions is obvious. 

Now, in quantum mechanics, the magnitude <$, or </> 2 (in case of 
real functions), is a measure of the relative concentration or density at 
any point, per unit length, per unit area, or per unit volume (depending 
upon whether is a function of one, two, or three variables, respectively). 
Consequently, the average values of variables are given by formulas 
similar to those derived above, in which p is replaced by < 2 (or <$). If 
is a function of x only, the average values of x and p, the correspond- 
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ing momentum, are given by the relations 



I $x<t>dx 
V. =^ - * 
I fodx 

/ 00 

r 



(30) 



PAV. - -- (31) 



In the case of normalized eigenfunctions, the denominator is equal to 
unity, corresponding to the physical interpretation that the total con- 
centration is to be taken as the standard of reference. 

It will be observed that in equation (31) $p<t> has been written instead 
of <fct>p. The reason for this is evident when we remember that in 
quantum mechanics p has the significance of the operator, 

= A.JL 
P " 2iri ' dx * 

(See Chapter II, section 7.) 
Hence, ^T ^* 

$p<t>dx = dx, 
2in dx 

while $0p has no physical significance. That is, p and </> are non-commu- 
tative symbols. On the other hand, x and <t> are commutative, so that 
we could write <f*t>x, but it is customary to write the product in the 
form given in (30) in order to preserve the analogy with (31). 

As an illustration of the application of these equations let us consider 
the following problem. Given the state of a system of particles repre- 
sented by the eigenfunction 

A AOLX i Tt^icix flO} 

q> = At T" -) , (o**) 

what are the average values of x and of the momentum? 

The physical interpretation of the two terms on the right-hand side 
of this equation has been considered previously in Chapter II, so that 
the following remarks merely supplement the previous discussion. 

Let us now apply equation (30) to each term in equation (32). For 
the first term 



#Av. = 




i 



AAdx 2AS r 

) t/ 00 



= 0, 
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which may also be deduced from the fact that x changes sign in passing 
from x = oo through x = Qtox*=+<*>, and that therefore the 
area under the line y = AAx and the axis of x for x = to x = < 
cancels the area f or x = oo to x = 0. 

Similarly, O?AV. for Be~ iotx is found to be equal to zero. That is, the 
function <t> in (32) represents a state in which there are just as many 
particles on one side of x = as on the other. 

For the term c"**, the average momentum is given by 



PAY. = 



ictx^iax, 



*'dx 



dx 
) 



ha 
2ir' 



Similarly, for the term i~ iax , the average momentum is found to be 
-ha/2*. 

In the case of the function 0, defined by equation (32), the average 
momentum is obtained from the relation 



PAv. 



where c\ denotes the concentration per unit length of particles with 
momentum pi, and c 2 the concentration of particles with momentum p%. 
In the present case Ci refers to the particles possessing momentum 
Aa/(2ir), and c% refers to those with momentum ha/(2ir). The values 
of the concentrations are given by 



r iax - | A | 2 ; c 2 - | B | 
Hence, 

\A | 2 ~ | B | 2 ha 



PAv 



I A | 2 + 1 B | 2 27T 



From this result it is evident that the sign and magnitude of 
are governed by the relative magnitudes of | A | 2 and | B | 2 . 
As another illustration we shall determine average values of x and 
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of p for the linear harmonic oscillator. In terms of the normalized 
functions, 



Now c X *H n is always positive, so that the product of this function 
with x changes sign at x 0. Hence the integral vanishes. That is, 
there is an equal likelihood for the occurrence of the particle on either 
side of the origin. 

Again, 



But 

dq dx 

Hence, 



d rr d 

= V6- 



Now 

x2 jc2 



so that the expression to be integrated is 



It is evident that each of the terms in this expression is an odd 
function. For, if H n (x) is a polynomial of the form a b x k , H' n (x) must 
be of the form a^fec*" 1 . Consequently the product H n H' n will consist 
of a series of odd powers of x, while xH% must be an odd function for 
the reason mentioned in discussing the value of #AV. Therefore PAV. 
must also be zero. 

That is, it is not possibk to perform any experiment by which either the 
position or velocity of the oscillating particle may be determined at any given 
instant. 

The argument by which this conclusion was derived was essentially 
to the effect that in both cases the expressions to be integrated con- 
tained odd powers of x. It is therefore evident that the average values 
of g 2 and p 2 will not vanish. 
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It should be observed in this connection that the operator p refers 
to the coordinate of position q, so that 

h d hVb d 



-i dx 
and 

V 2= _**... 
P 4;r 2 dx 2 

By definition of the average value, as given in equations (30) and 
(31), 

[x)dx, (33) 



-- 

(34) 



The exact calculation of these integrals in the general case involves 
more elaborate methods than those that may be discussed in the present 
connection. However, it is possible to carry out the calculation for the 
case HQ(X), that is, for the zero energy state, for which the energy 
EQ = /w /2 and HQ(X) = 1. Substituting in equations (33) and 
(34), we obtain from equation (24) the relations 



and 



7-5 I C M* 2 3 2 2 Jcte 
47TV-00 \ / 



(35) 
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According to classical mechanics, the kinetic energy of the oscillating 
particle is T = p 2 /(2/*), and this varies from T = at the limits of 
vibration to T = EQ when passing through q = 0. Consequently, 
the average value of T is Uo/2, and as will be observed from equation 
(35), the average value of p 2 /(2/i) on the basis of wave mechanics is 
found to be equal to E Q /2, as in the classical case. 

Since 3 2 Av . = 0.5, it follows that Vx^. = d= VO5 = 0.707, 
where | XQ \ = 1 represents the maximum amplitude of vibration. 

6.4 Probabilities of Transition. 9 In connection with the calculation 
of probabilities of transition from one energy state to another, it is neces- 
sary to calculate the integral 

(36) 



N m N 

where n and m refer to two different states for which the energies are 
E n and E m respectively. The integral in (36) is known as a matrix 
component integral, because it is of fundamental importance in the type 
of quantum mechanics introduced by W. Heisenberg which has been 
designated as matrix calculus. 
In terms of Hermitian polynomials 

JN = JmnN m Nn = fY* 2 ff n (oO H m (x)xdx. (37) 

*/ oo 

Since 



we can, by repeated application of the standard formula for integration 
by parts and cancelation of identical terms of opposite sign, write 



(For convenience we shall omit the limits of integration in writing the 
integral.) 

Since ~~* 2/2 H n (z) vanishes at x = < , the first term on the right-hand 
side disappears. Also, according to (19), we may replace H^ and H f n , 
respectively, by the expressions 2w# m _i and 2nHn-i* Hence we 
derive the relation 



J N = mr^HnHn^idx + ne-^HmHn-idx. (38) 


9 This topic is discussed in Chapter XV. 
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Now, as stated previously, the functions <t> n = e~* 2/2 /? n form an orthog- 
onal system, so that for n j& fc, 

/ 

= 0. 

Therefore, the integrals in (38) will each be equal to zero, unless 

n = m 1, that is m = n + 1, 
or 

m =B n 1. 

In other words, J mn has a definite value only if m differs from n by 
+1 or 1. The physical interpretation of this conclusion is that in the 
case of the linear harmonic oscillator the only possible transitions are 
those between adjacent states, for which An = d=l, and therefore 



Let us consider the case m = n + 1. Then 
_ (n+D /V'to* = ^ 

J " n 



Using equation (25), we obtain the result 

J n +i,n = -J r (39) 

* ^ 

Similarly, it follows that 

C 

(40) 



SUPPLEMENTARY NOTE 1 

PROOF THAT SOLUTIONS OF THE SCHROEDINGER EQUATION FOR 
ONE DEGREE OF FREEDOM ARE ORTHOGONAL 

For the case in which the S. equation is expressed in terms of a single 
coordinate variable, the proof that the solutions form an orthogonal 
system is as follows. 

Let us consider the S. equation of the form 

0, . (i) 

where fc 2 = 8w 2 vi/h 2 . 

Let \l/ m and \l/ n denote any two solutions valid for the range x = - <*> 
to x = + oo , and let E m and E n denote the corresponding eigenvalues, 
which are not identical. In the general case \{/ m and \l/ n will be complex 
functions, so that in order to obtain a real magnitude we must take 
the product \l/ m $ m or $ n ^ n . 

In consequence of (i), we have the two relations 

= 0, (ii) 



dx* ' '" ww 

h Wn - V) n = 0. (iii) 




If we multiply the first of these equations by $ n and the second by 
w , and integrate between the limits x = =t , we obtain the relations 

(iv) 

(v) 
%/ 

// d rm . .* < -// 

where ^ ~ 2 , and similarly for y n . 
Subtracting (iv) from (v), the term involving V cancels out, so that 

+ k\E n - E m ) I tmr-ndx 0. (vi) 
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Now 



Also 

d f . y/v . 7/7 

J dMO = iM'n 

Hence, 



~ **)] = 0, 
J oo 



since $ m and ^ n , as well as their differential coefficients, all tend to 
become equal to zero as the limits are approached. 
Consequently, equation (vi) reduces to the relation 



k*(E n - E m ) ^ndx = 0. (vii) 

/ 00 

Since k 2 is a finite quantity, and E n is not equal to E m , it follows that 

dx = 0. (viii) 




> 



That is, any two eigenfunctions corresponding to two different energy 
states are orthogonal. 

The use of the designation " orthogonal " is due to the analogy be- 
tween relation (viii) and that which expresses the condition that two 
vectors shall be at right angles. Thus we can represent a momentum 
p = /it;, in both magnitude and direction, by a straight line of length 
p (a scalar magnitude) which is drawn with respect to the three rec- 
tangular axes of coordinates in the direction of motion. The line thus 
drawn constitutes a vector. Let O x , 6 y , 6 Z be the angles between this 
line and the three axes of coordinates. Then the components of mo- 
mentum along the three axes are given by 



so that 



p x = p cos O x , 

Py = P COS Oy, 

p z = p cos M , 



Now suppose we have another vector P with components Pa- 
etc., where y x is the angle between the vector P and the axis of x. 
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Let <t> denote the angle between the two vectors p and P. Then, 
it follows that 

cos <t> = cos 6 X cos ri x + cos 6 y cos % + cos B 9 cos rj z . 
Consequently, 

PP COS <f> = P X P X + PyPy + P Z P Z . 

The product on the left-hand side is a scalar magnitude. In vector 
notation this is written p P and is known as the " dot product " of 
the vectors. 

In order that the two vectors shall be at right angles, it is necessary 
that <t> = 7T/2, and that cos < = 0. Hence, the condition for the or- 
thogonality of the two vectors is given by the relation 

PxPx + PyPy + PzPz = 0. 

If now we assume a space of, say, n dimensions and consider two 
vectors in this space, each with n components along the n-axes of 
coordinates, the last equation becomes 

r?p fc p& - o, 

where k has all values ranging from 1 to n, and denotes the sum of 
all these products. By extending this argument to a space of an in- 
finite number of dimensions, and regarding $ m and $ n as vectors in 
this space, equation (viii) follows logically. 



SUPPLEMENTARY NOTE 2 
THE GAUSS ERROR FUNCTION 

The curve for <o(#) =~* 2/2 is essentially the same as the Gauss error 

h 

curve, which is ordinarily expressed in the form 10 y = 7= e ~~ h * xZ . In 

VTT 

this expression h corresponds to the absolute measure of precision, 
and x is the magnitude of the deviation from the mean value. Thus 
y is a measure of the probability of occurrence of a deviation of mag- 
nitude x. 
The probability integral is given by 

P ~ 2h 



For h = 1, the integral is known as Gauss's Error Integral, or the 
" erf " function, and is designated by 

0(x) = 

The following table gives values of this integral for different values of 
x, as taken from L. Silberstein's " Mathematical Tables." 

2 r 

x ^ I 

vVJo 

0.0 0.0000 1.0000 

0.25 0.2763 0.7237 

0.5 0.5205 0.4795 

0.75 0.7112 0.2888 

1.0 0.8427 0.1573 

1.5 0.9661 0.0339 

3.0 0.99998 0.00002 

COLLATERAL READING 

The problem of the linear harmonic oscillator is discussed in all the treatises 
on quantum mechanics (see list in Appendix I), but the reader will find it of special 
assistance to consult the following: 

1. CONDON, E. U., and MORSE, P. M., "Quantum Mechanics," pp. 47-52. 

2. PAULING, L., and WILSON, E. B., Jr., "Introduction to Quantum Mechanics," 

pp. 67-82. 

3. DABROW, K. K., Bell System Tech. J., 6, 653 (1927). 

4. SOMMERFELD, A., "Wave Mechanics," Chapter I. 

10 See, for instance, Mellor's "Higher Mathematics." 

136 



CHAPTER VI 
THE RIGID ROTATOR 

6.1 The Schroedinger Equation in Three Dimensions. In order 
to discuss such problems as that of the rotational energy states of a 
diatomic molecule, or the energy states of a hydrogen-like atom, it is 
necessary to formulate the S. equation in three dimensions. As for 
the equation in one dimension, we employ as a starting point the equa- 
tion for the propagation of a wave motion, which with rectangular axes 
has the form 



u 2 dt* 

where \l/ is the amplitude, V 2 is designated the Laplacian differential op- 
erator, and u is the velocity of propagation of the wave. This is a par- 
tial differential equation of the second order, and the complete integral 
must define ^ as a function of the three coordinate variables and of the 
time t. 
As in solving equation (#.43), we assume a solution of the form 

*(*, V, z, t) = 0(x, y, z) c- 2 **", 
where v\ = u, and consequently 



Hence, 

47T 2 



Since 



ax 2 



and similar relations apply for 3 2 \l//dy 2 and dV/dz 2 , it is evident that 
the solution of equation (1) satisfies the solution of the partial 
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differential equation 

d 2 <t> 



or 

V 2 0+^--< = 0. (25) 

A 

By substituting in the last equation de Broglie's relation for X, we 
would obtain the corresponding S. equation in terms of rectangular 
coordinates. However, in problems involving rotation about an axis of 
symmetry, or motion of a particle in a central orbit (e.g., the motion of 
an electron about a positively charged nucleus), it is much more con- 
venient to express the Laplacian operator in spherical coordinates. This 
type of coordinate system is illustrated in Fig. 22a, Chapter IV, and 
the relations between the coordinates r, 0, 17 and the rectangular co- 
ordinates were given in the equations in U-19). 

The object of such a transformation of coordinates is, as will be shown 
by the subsequent argument, to express the S. equation in such a form 
as will make it possible to separate the partial differential equation into 
three ordinary differential equations, each corresponding to one of the 
three generalized coordinates. The method is thus analogous to that 
used in Chapter IV for the solution of the problem of the hydrogen-like 
atom in terms of spherical coordinates. 

6.2 Rules for Transformation of Coordinates. In Chapter IV, 
section 3, it was mentioned that spherical polar coordinates constitute 
a special case of a more general class known as orthogonal curvilinear 
systems of coordinates. 1 It has been demonstrated that the three- 
dimensional S. equation can be separated only in those cases in which 
it is expressed in terms of such generalized coordinates. Designating 
these by q\, q^ and #3, the element of distance ds is given, according 
to equation U22), by a relation of the form 

where each of the coefficients ai, a 2 , and a 3 is in general a function 
of ?i, 22, and g 3 . 

By the application of the theory of vectors 2 or by the application 
of the methods of the calculus of variations, 3 it is shown that in terms 

1 8ee Pauling and Wilson, "Introduction to Quantum Mechanics/' pp. 104 and 
443, for further discussion. 
2 See Appendix IV, Section 13. 
8 Courant and Hilbert, p. 194. 
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of the generalized coordinates 




and that the element of volume dr is given by 
dr = dxdydz = 

where the expression Valets is known as the discriminant. 

It will be observed that since aj, a 2 , and a 3 may each be functions 
of 31, g2> and g 3 , the order of operations is important. That is, 

d /Vaia 2 a 3 d<t>\. .., Voio^os 3 2 

i - . 1 1S no t identical with - The operators 
dqi\ ai dqi/ ai dq\ 

are non-commutative, and in deriving the S. equation this is of extreme 
significance. 

Now in the case of spherical coordinates, as stated in Chapter IV, 
we have the following relations 

Qi = r; q 2 = 6] 3 = 1, 
and oi l; 02 = r *\ a a = r2 s * n2 ' 

Hence 

Vaia2a 3 == r 2 sin d, 
and therefore 

dr = r 2 sin BdOdydr. 

Consequently equation (3) assumes the form 

f 3/2 - * d <t>\^ d f a d( t>\^ d / 1 a *M /^ 
( r sin ^ H I sin IH I - ]} (4) 

^ ^ V ; 



r*sm 
It is evident that 



d ( 2 . d<A . A a / 2 a</> 
r-(r 2 sin^- 1 = smtf -T-lr 2 - 
dr\ dr/ dr\ dr 



O / CrO\ 

-(sine- 1 



r vy ^' Y' 

while -(sine- 1 = cos9- + sinfl-7^ 
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and 



d ( 




sin0 



d<t>\ 1 3 2 
I = - ^ 

dii/ sin0 6V 



where = <t>(r, 0, 17). 

Thus, for periodic motions of a particle in three dimensions, equation 
(26) with the value of the Laplacian operator given in equation (4) 
takes the place, in quantum mechanics, of the energy equation used 
in ordinary mechanics, which is of the form 

E = | M (r 2 + r 2 ^ + r 2 sin 2 $ 2 ) + V(r, 0, r,), 

where the first term, as indicated in equation U-37), expresses the kinetic 
energy, and the second term, the potential energy as a function of the 
coordinate variables. 
This equation is customarily written in the Hamiltonian form 




V(r, 6, 



(5) 



and is the form which equation (4-88) assumes for the general case. 

Under certain conditions equation (4) assumes simpler forms. Thus, 
for constant value of r (rotation of a sphere about an axis), d/dr = 0, 
and the Laplacian term becomes 



5 sin0 60 



dO 



For the case of plane polar coordinates (r, 

derived from the relations 

x = r cos TI 
y = rsin ?y 

as follows: 4 



1 sin 2 drj 2 
the Laplacian may be 



Since r 



+ y 2 , and tan tj = y/x, therefore, 






r 31 



4 Slater and Frank, "Introduction to Theoretical Physics, 19 pp. 164-165. 
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Now, 

d4> _ a# dr &j> 3ij 

dx ~ dr dx aij dx 

Consequently, 

d^ = aWaA 2 a-Wdr aA aWM 2 

fla; 2 ~ dr 2 W 3r^ \d* ' dx) V \dx) 



+ dr ' dx 2 dri ' dx 2 ' 

d z <t> 

md a similar equation will be obtained for 3 . Adding these two equa- 

oy 

kions, we have 

* . ^ = ^!* 17 Y + ^- Yl 4- 2 -^ ^- ^ + - 
dx 2 + dy 2 dr*[\dx) \dy) J drd0\dx'dx^dy'dy 



Substituting from the first set of equations, the result is 

a 2 a 2 \ /a 2 i a 2 i 



It will be observed that this result could also have been obtained 
directly from the rule of transformation in equation (3) thus: 
Since (<fe) 2 - (dr) 2 + r 2 (3i,) 2 , 



o x = 1, 02 = r 2 ; \/oi02 = r. 
Therefore, 
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In the case of radial motion, for which d/dO = d/drj = 0, the Laplacian 
operator evidently assumes the form 



. 1 d( 2 d\ d 2 I d 

V 2 = -o 'TV 7 * T) ^Tl + ^'T 
r 2 dr\ dr/ dr 2 r dr 



We shall now return to the consideration of equation (2fe). If we 
express V 2 in this equation in terms of spherical polar coordinates 
and substitute for X, the de Broglie relation 



we obtain the S. equation of the form 

0, (8) 



where U, the potential energy, is a function of r, 0, and 17. As in the 
case of the S. equation for one coordinate variable, we seek solutions of 
equation (8) that will be physically rational. Thus <# must not become 
infinite at any point in space, and it must tend to vanish as those regions 
are approached in which the probability of occurrence tends to become 
zero. The exact form of these " boundary conditions " must depend 
upon the nature of the particular problem. Thus, in the case of a 
hydrogen-like atom, the probability of occurrence of the electron must 
decrease continuously to zero as r tends toward infinitely large values. 
We shall find that actually this probability becomes infinitesimally 
small for values of r exceeding only a few atomic radii. In the case of 
the angle variables, the limits are < < TT and < r\ < 2^r, and the 
distribution function, as we shall designate <$, must exhibit a perio- 
dicity with respect to these variables. That is, <$(0, i?) = <$(0 d= TT, 



Furthermore, because experimental observations show that any 
atomic or molecular system can exist only in a series of discrete states 
defined by the energy values BI, U 2 . . . and so forth, we must expect, 
if the solutions of the S. equation correspond to the observations, that 
" sensible " solutions of equation (8) will exist only for a series of 
discrete values of the energy E y which will constitute the eigenvalues. 
The corresponding eigenfunctions < will represent, in the most general 
case, amplitudes of stationary de Broglie waves in three dimensions, 
and cannot therefore be visualized physically. In the case of constant 
values of r, < represents the amplitude of vibration of a spherical sur- 
face, and hence the functions are known as surface spherical harmonics. 
They are represented by expressions which are functions of the latitude 
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(6) and longitude (17) and which exhibit nodes and loops along both 
meridian circles and zonal circles (parallel to the equatorial plane). 
Consequently, the mathematical expressions are quite complicated and, 
in fact, appear formidable at first glance. 

To some, indeed, it might appear that the mathematician has en- 
dowed nature with a complexity far beyond its needs. Yet the only 
reply to such an accusation must be that the " simple " solutions, 
those which are relatively easy to understand (because they involve 
no " higher mathematics "), do not correspond to the facts. Nature 
is complex in its fundamental elements, and the only feature that is 
astounding is this: that human intelligence has been able to devise a 
method of reasoning with symbols by which a one-to-one correspond- 
ence is attained between the deductions from this reasoning and the 
experimental facts. This, to the mind of the writer, has always ap- 
peared the most marvelous aspect of all mathematical technic in dealing 
with the interpretation of nature. And because it is stimulating to 
understand this " picture"; because it must add a certain measure 
of pleasure to perceive, even though it be dimly, at first, the results 
attained by combining transcendental imagination with the most 
exacting type of logic because of those rewards which the effort 
holds forth let the reader not be discouraged too readily. Patience 
and persistence alone will accomplish wonders, even in the compre- 
hension of a symbolic mathematical technic. 

6.3 The Rigid Rotator with Fixed Axis. Let us consider the problem 
of a diatomic molecule constituted of two atoms of masses m and ju 2 > 
located at distances ri and r 2 from the axis of rotation. We shall 
assume that the molecule is of the " dumbbell " form, so that the dis- 
tance between the centers of the atoms (TI + r 2 = r* ) is fixed, and there- 
fore we neglect the possibility that the atoms will vibrate along this 
axis in virtue of their mutual attractive and repulsive forces. (Of 
course, such vibrations, of frequency nv Qt actually occur and give rise 
to vibrational energy states a problem which was considered in the 
case of the harmonic oscillator.) Under these conditions we may 
regard the molecule as possessing, in general, two degrees of freedom or 
mobility. The molecule will have a rotational motion about an axis of 
symmetry passing through its center of gravity. This will be repre- 
sented by an angular velocity q = dvj/dt in the plane YOX (see Fig. 22a). 
Also there will be a precessional motion of the fixed axis of the molecule 
about the axis of symmetry, which is represented by the angular velocity 
6. Since there is no potential energy term, the total energy is all kinetic 
and is given by 

E = J0ir? + 10$) (P + sin 2 . ?). (9) 
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Since the molecule is rotating about its center of gravity, 
Hence, if we put 



Ml +M2 

then 

2 J M1M2 2 _ 2 _ r 

Ml*! + M2^2 ~ T ' *<> ** M^O ~ /i 

Ml + M2 

and we can write (9) in the form 

S = -(02 + sin 2 0-i7 2 ), (10) 

where I = moment of inertia of the molecule about its center of gravity, 
r = mean radius of gyration, and I/M = I/MI + 1/M2> where /* is known 
as the " reduced ;> mass. 

Thus r$(6 2 + sin 2 6 i} 2 ) - v 2 , where v is the velocity of rotation, and 
the corresponding de Broglie wave length is given by 




" V2EI 
Hence, the S. equation 



becomes 



Since / is a constant for any given diatomic molecule in the state 
for which the energy is E, it follows that both v and X are constants. 

Furthermore, it follows that we may use the form of the Laplacian 
operator given in (6) with r 2 = r. Hence, multiplying both terms by 
r the equation to be solved is 





There are two cases which may occur. In the first of these, the mole- 
cule is free to revolve only about an axis at right angles to the axis of the 
molecule. This is known as the case of the rigid rotator with fixed axis. 
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In the second case, the molecule may exhibit a motion of 'precession, as 
well as that of rotation. The latter is the case for which equation (11) 
applies and will be discussed in a following section. In the first case, 
however, d/SO = 0, sin = 1, and the equation reduces to the ordinary 
differential equation 

d 2 <t> o^ n /io\ 

g + m = 0, (i\ 

<V 

where m 2 = 8ir 2 EI/h 2 is used to indicate that the coefficient is always 
positive. 
The solution of this equation has been discussed previously. It is 

= A* imri + Br im<n , (13) 

which may be written in the form of a sine or cosine function. Thus 
we may use the form 

SB C sin- (my + 6), 

where 5 is a phase angle. 

Now this equation has physical significance only if m 0, 1, 2, etc. 
Consequently, the eigenvalues of the discrete energy states are given 
by the relation 

m *h 2 

m ~ Sir 2 / ' 

where m is an arbitrary integer. 

Now from equations (10) and (14) it follows, for the case d/dd = 0, 
that 



Hence, HI = 

*7T 

That is, the angular momentum of rotation of the molecule is equal to 
an integral multiple of h/2w, and the plus and minus signs refer to op- 
posite directions of rotation. Thus, the interpretation of t imri is analo- 
gous to that of r* 1 *** in the case of wave propagation along the s-axis. 
(See Chapter II.) 
If we set r2 v 

AX I '"V^ - 1, 

t/O 

it is evident that A A 

2ir 
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Consequently, corresponding to any one eigenvalue E w , we have the 
two normalized eigenfunctions, 

* m = -^e-<, (16a) 

V2ir 

and 

(156) 

Obviously, the functions <t> n and $ OT , n ^ m, are orthogonal, since, as 
shown in Chapter III, 

/2ir /2ir /2r 

I gtCm-n)^ ^ I cos ( m _ n)i|dif + i I sin (m n)^ = 0. 
Jo t/o Jo 

It will be observed that in this case there are two eigenfunctions, given 
by equations (15a) and (156), corresponding to any given eigenvalue. 
We have here an illustration of a condition that is met with frequently 
in the solution of problems in quantum mechanics. Such energy states, 
for which there are available more than one eigenfunction for any given 
eigenvalue, are known as degenerate (German, " entartet "). Physically 
this is interpreted, in the present case, as indicating that actually there 
are two energy states which have become merged (degenerated) into 
what appears to be one state, because the energy is the same irrespective 
of the direction of rotation of any mokcule with respect to other molecules. 
However, if the molecules are placed in a magnetic field, the energy 
will vary (because the molecules possess magnetic moments) with the 
direction of rotation of the molecule. For one direction of rotation 
the energy will be slightly greater, and for the opposite direction slightly 
less, than the value E m which exists in the absence of a magnetic field. 

6.4 The Rigid Rotator with Free Axis. We shall now consider the 
case of the rigid rotator with free axis, for which the S. equation is that 
given in (11). The solution must represent <t> as 0(0, 17), that is, as a 
function of the two angle variables. To solve the partial differential 
equation, we postulate a solution for < of the form 



where X is a function of 6 only, and Z of t\ only. We have an indica- 
tion that this is possible from the fact that in the case of the rotator 
with fixed axis we have already found a limiting case of this problem. 
Evidently, 
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and 



Substituting for <f> and its derivatives in equation (11) and using the 
symbol 



--"' (16 > 

we obtain the relation 

rj i / JW V j2?7 

^:l( sin '-?)+4vT?+ 2 *z==o, 

sm0 dd\ dti/ sm 2 dif 



where Z = Z^andX = X(6). 

Since sin 2 0/(XZ) never becomes infinite, we can multiply through by 
this factor and thus obtain the equation 



sin 



!EJ? !L( \+ 2 2 6 = _ 1 d * z , 
X ' de\ ' dd) m ~ Z' drj 2 ' 

It will be observed that the left-hand side does not involve 17, and the 
right-hand side does not involve 6. Since this relation must be valid 
for all possible values of and 17, it follows that each side of the equation 
must be equal to a constant, which we shall designate by m 2 . We thus 
obtain the two ordinary differential equations 



~ = (17) 

and 

(18) 



sin - ^ (sin e ^) + (c? sin 2 - m 2 )X = 0. 
ad \ cw / 

The first of these is identical with equation (12). Therefore the 
solutions are given by 

imij 

(19) 



where m = 0, 1, 2, etc. 

Now let us consider equation (18), and as a first step in the process 
of solving it we change to a variable x, such that 



x cos 0. 
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Therefore, 

1 x* sin 2 0, 
and 

d . d 

= sm0- 
dd dx 

Before carrying through the transformation to the new variable, we 
may divide through by sin 2 0. This gives 

- - -r- ( sin ) + ( a 2 r-s-r I X = 0. 

sm0 dS\ d9/ \ sm*e/ 

Introducing the variable x, this becomes 



This equation is one of the most important in mathematical physics 
and is known as Legendre's equation of order m, in x, where 1 < x < 1. 
That is, the equation has physical significance between the limits 
z = cos0=dbl. These limits constitute so-called singular points, 
since 1 x 2 = at these points. 

Since m can have any integral value, including 0, we shall consider 
first the solution of equation (20) for the case m = 0, that is, the 
Legendre equation of order zero, 



or (l-a; 2 )--2z.P + a 2 X = 0. (21) 

dx dx 

Let us assume, as in equation (5.13) for the linear oscillator, that X 
may be represented as a polynomial of degree fc, so that 

X - Ea***, (22) 

where k 0, 1, 2, . . . (k - 1), fc. 

Then 



/7V 

= -22afc *k -x k . 
dx 
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Also, in terms of coefficients of x k , 



and 



Therefore, the coefficient of x k in (21) is given by 

aw(k + 2) (k + 1) - a k {k(k - 1) + 2fc - a 2 }. 

Since equation (21) is valid for all values of x (in the range 
!<#<!), it follows that the coefficient of each power of x must 
vanish identically. Hence, 

a k {k(k + 1) - a 2 } 
- ( t + 2)(Jb+l) ' 

Thus 



a & fc + 2 (* + !)(* + 2) 



If k can increase beyond limit, a&+ 2 M = 1 for very large values of fc, 
and consequently, if the series for X defined by equation (22) is to con- 
verge for x = 1, it must have a finite number of terms and the highest 
power of x is given by /b, where a& +2 = 0. Consequently, 

k(k + 1) = a 2 . (24) 

Substituting for a 2 from equation (16) it follows that E can assume 
only the series of discrete values given by the relation 



4 

where k = 0, 1, 2, etc. 

This relation is different from that deduced for the rotator with rigid 
axis which was stated in equation (14), and is actually in much better 
agreement with the spectroscopic observations on the rotational energy 
levels of molecules than the latter, which is identical with the relation 
derived by means of classical mechanics. 

6.5 Legendre Equation of Order Zero. Substituting in equation 
(21) the value for a 2 deduced in equation (24), we obtain the differential 
equation for X k , the eigenfunction corresponding to E k , in the form 

(1 - x 2 )X' k ' -2x-X' k + k(k + l)X k - 0. 
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From (23) it follows that 

{(k - 2) (fc - 1) - k(k + 1)} 



2(2fc - 1) 



*(* ~ 1) /rv 

or *- 2= -' " (26) 



Similarly, it is readily shown that 

(ft - 2) (k - 3) 

*-* = -- 4<a-3) 

k(k - 1) (fc - 2) (fc - 3) 
= 2.4.<a-l)(2fc-8) ' *' 

- 1) (k - 2) (fc - 3) (k - 4) (fc - 5) 



* 



If k is even, the power series beginning with x k will end with OQ, if A; is 
odd, with aix. We thus obtain the series, 



^ + 2tii_ + 2tL_ + . . I, (29) 

where the coefficients of the various powers of x are given by relations 
similar to (28), and a& is arbitrary. 
If we assign the value 5 

(2k - 1) (2fc - 8) . . . 1 

the resulting function Xk is known as a Legendre function of order zero 
and degree k or a surface zonal harmonic and designated by the symbol 
P k (x) = PA? (cos 6). Thus the complete expression for the Legendre 
function has the form 



_ (2fc-l)(2fe-3)...ir fe _ 
Pk(x > - k\ L - 

t(t - 1) (t - 2) (fc - 8) . gfc _ 4 _ 1 
^ 2 4 (2Jb - 1) (2* - 3) J V ; 

1 This value, as will appear from (31), makes the first Legendre function PO(Z) = 1. 
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The first few members of this series are as follows: 8 



or P (cos 0) = 1 

or PI (cos 6) = cos0 

orP 2 (cos0) = |(3 cos 2 8- 1) 

or P 3 (cos 0) = 4(5 cos 3 9 - 3 cos 




(31) 



-1.0 
FIG. 26. Plots of the first four Legendre polynomials as functions of i 

1.0 




-0.5- 



FIG. 27. Plots of Legendre polynomials 



inclusive. 



At 6 = 0, cos = x = 1, while at = TT, cos = -1, and at = ?r/2, 
cos = x = 0. The functions may be plotted as functions of either 
or x. In the former case, the limits are and IT; in the latter, the 
corresponding limits are 1 and - 1. Figures 26 and 27 (taken from the 
curves plotted in Byerly's treatise) show graphs of the functions PI (cos 6) 

6 Tables of values of PI to PI (inclusive), as functions of 0, are given in L. Silber- 
stein's ''Mathematical Tables" and in an appendix in Byerly's "Fourier's series." 
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to Pyfcos 6), inclusive, as functions of 0. It will be observed that at the 
limits all the functions have the identical absolute value 1. Further- 
more, since 



it follows that for k even, Pk(-x) = Pk(x), that is, the function is 
symmetrical about x = or = w/2, resembling in this respect sin 0, 
while for k odd, Pfc(-z) = -Pfc(z), and the functions change sign in 
passing through x = or = 7r/2, which also is characteristic of cos 0. 
As pointed out in connection with the Hermitian polynomials, the 
number of roots, corresponding to nodes, along the axis of x or 0, is equal 
to that of the highest power of x. Thus P\(x) = cos 6 passes through 
at = 7T/2, while PZ(X) exhibits two nodes which may be deter- 
mined from the quadratic relation 

P 2 (*0 = J(sV3 + 1) (aV3 - 1) = 0. 

Hence P 2 (x) = for cos 6 = dbl/Vs = 0.5775, that is, for 0i = 
54 45' and 2 = 125 15". In a similar manner it is possible to cal- 
culate the k values of at which any given function P&(cos 0) becomes 
equal to zero. 

As for the other polynomials, it is readily shown in any of the treatises 
on this topic that the Legendre polynomials are related by a recursion 
formula of the form 

(Jb + l)Pft+i(s) = ( + l)xP h (x) - fcP*-i(s), 

which makes it possible to calculate higher members of the series from 
the lower members. 
From equation (30) it follows that 

C*PMHr (2fc-l)(2fc-3).. .ir^ (fc+1)* 
Jo Pk(x ^ -- (fc+T)l L 



- 1) (fe - 2) ^ 1 

"*" 1) (2k -3) ' ' J 



(fc + 
2-4-(2fc 

and if this integration is repeated k 1 times more, the result is 



- 3) . . .If u, 2fc(2fe-l) 2 ^. 2 
L 2- 



(2fc-l) 

2fc(2fc - 1) (2k - 2) (2k - 3) 2ft _ 4 _ I 
"*" 2 4 (2k - 1) (2Jk - 3) ' ' "J 

2 -D*> (82) 



(2k) (2fc - 2) . . . 2 
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as may be demonstrated by expanding the expression (x 2 1)*. The 
kth derivative of the function on the right-hand side of the last equa- 
tion is therefore Pjb(x), so that 

2 - 



This is known as Rodrigues' Formula. 

It may be shown by means of this formula that the Legendre poly- 
nomials form an orthogonal system, since 

( P k (x)P n (x)dx = (for* * n) 
J-i 

9. 

= n). (34) 



2k + 1 
Hence, the normalizing factor is 




<35) 



It follows from these relations that if n + k is even and n 7* k, 

f l p k (x)P n (x)dx = ^ ) p k (x)P n (x)dx - 0, 
Jo /-! 

and also that 



The orthogonality relation (34) may also be derived by the following 
proof which is independent of Rodrigues' formula, and is of very general 
application in spherical harmonics. 

Since P k (x) and P n (x) satisfy Legendre's equation, 



0, (i) 

ax 



and 

Multiplying (i) by P n and (ii) by P*, subtracting, and then inte- 
grating, we obtain the relation 

[n(n + 1) - *(* + D] 



-i 



dzL da;J J-i 
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Integrating the right-hand side by parts, it becomes 



which is equal to zero, since 1 x 2 = 0ata;=l, while the two inte- 
grals cancel. Consequently, if n y^ k, 

C l P n (x)P k (x)dx = 0. 
J-i 

6.6 Associated Legendre Functions. We shall now consider equa- 
tion (20) for the case m ^ 1. In the preceding discussion this equation 
was solved for the case m = 0, and it was found that the equation gives 
solutions which are physically significant if we put 

a 2 - k(k + 1). (24) 

If we substitute for a 2 in equation (20) we obtain the relation 



Let us introduce a new function Y defined by the relation 

m 

X = (1 - x 2 ^Y. (37) 

Then 

2 2+i 

dX -mx(\ -s 2 ) 2 r (1 - a 2 ) 2 dF 



dx ~ (1 - * 2 ) (1 - a; 2 ) dx ' 

(1 - x 2 ) = -m*(l - x^Y + (1 - * 2 )* +1 -^, 
dx ax 

and 



Substituting these relations into (36) and dividing by (1 a 2 ) 2 , 
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which is not zero, except at the limits, we obtain the relation 



(1 - z a )TT - 2(m + 1)* - + (* - m) (Jfc + m + 1)F - 0. (38) 
ax ax 

With this differential equation we shall compare the differential equa- 
tion (21) which can evidently be written in the form 

2x ~ + k(k + 1)P* = 0. 
ax 

If we now differentiate this m times, it can be shown that the resulting 
equation is identical with (38), and consequently, 

Y flf w Pi 

y = _*_ = . (39) 



The function X thus obtained, see equation (37), is known as the 
associated Legendre Function of the first kind, of degree k and order m, 
and is designated thus 

Mi-* 2 ) ? - (40) 



Since the differential coefficient becomes zero for m > k, it follows 
that m can have only the scries of integral values m = 0, 1, 2, . . . k. 
Thus corresponding to any value of k, there are (k + 1) Legendre func- 
tions which satisfy equation (20) and also (2k + 1) functions Z mt 
which satisfy equation (17), corresponding torn = 0, dbl, 2, . . . fc. 

Since 

(1 - x 2 ) 2 " = sin m e, 
the associated Legendre function may also be written in the form 7 



7 The notation given is that used by Byerly, MacRobert, Pauling and Wilson, and 
most of the authorities. On the other hand, Courant and Hilbert, as well as Condon 
and Morse, use the notation 



and 



156 THE RIGID ROTATOR 

The functions 



. m d m P k (x) 
cos mrf sm m 6 






and 



sin VM\ sin 



dx m 



(42) 



are known as tesseral harmonics of the kth degree and mth order. In 
terms of exponentials, the functions are 



and .-**&* . > (43) 



and as stated already, for any given value of fc, there are (2k + 1) 
functions which satisfy the differential equations (17) and (18). It will be 
observed that the only condition attached to m in the solution of (17) 
is that it must have an integral value (including 0). The condition 
that m cannot exceed k was derived from the subsequent deduction 
based on the fact that the eigenfunctions which satisfy equation (18) 
are of the type P^(x). 

It was also deduced that the eigenvalues corresponding to the different 
energy states are given by 

2 (25) 

(25) 

Thus it follows that for any given energy state, corresponding to a 
given value of k, there are (2k + 1) eigenfunctions. In the case of the 
rotator with fixed axis, it was found that for each energy state 
there are two possible eigenfunctions. The order of degeneracy is 
therefore two in that case. But in the case of the rotator with free 
axis we find that the order of degeneracy is 2k + 1. The physical inter- 
pretation is similar to that given in the previous case. As Condon and 
Morse describe it: " this is the degeneracy of random space orientation 
in a centrally symmetric field, and gives the multiplicity into which 
the terms are split when a non-symmetric perturbing field removes the 
degeneracy." Thus in the presence of magnetic or electrostatic fields 
this degeneracy may be completely removed, because such fields intro- 
duce perturbing effects. 

The functions P(x) satisfy the condition for orthogonality of the 
form 

f Pk(x) P?(x)dx - 0. (j*K), (44) 

t/ i 
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where obviously m must not exceed either j or k. This may be deduced 
by an argument quite similar to that used for demonstrating the orthog- 
onal nature of the Legendre polynomials of zero order. That is any 
two Legendre polynomials of the same order and different degrees are 
orthogonal to each other. 
It may also be shown that 



Hence the normalizing factor for the tesseral harmonic given in (42) 
or (43) is given by 8 

(46) 



._ 
1 (2k + 1) (t-m)lj 

Finally, it is of interest to give the expressions for some of the asso- 
ciated Legendre functions corresponding to the expressions for the 
functions of zero order which were given in equations (31). The first 
ten of these polynomials (including the functions of zero order) are as 
follows: 



- 1); Pl(x) - (1 - a 8 )- 
- to); Pl(x) - (1 - x 2 f -f(5z 2 - 1); 

Pl(x) = (1 - * 2 ) 15*; P 3 3 (*) - (1 - * 2 ) f 15, (47) 

where x = cos 6. 

Since d k P k (x)/dx k = C, a constant, it follows that the functions 
Pk(x), for which the order and degree are identical, possess no nodes, 
but pass through a maximum at x = (i.e., 6 = T/2) as is evident from 
the relation 

s0) = C-sin*0. (48) 



8 It is becoming customary in treatises on quantum mechanics to designate the 
square of the expressions on the right-hand side of (35) and (46) or the integrals in 
(34) and (45) by N. For instance, H. Bethe, in " Handbuch der Physik," Vol. XXIV, 
Part 1, uses this notation, which may give rise to some confusion for the reader who 
consults that discussion. 
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Figure 28 9 shows plots of the four normalized functions Pj}(cos 0), 
P|(cos *)> JP|(cos 6), and P|(cos 0) as functions of 0. The normaliza- 
tion factors, as deduced from equation (46), are N - V2/7, V24/7, 
x/240/7, and V1440/7, respectively. The actual relations for the 




-2.0 



~0 30 60 90 120 150 180 
Degrees 

FIG. 28. Plots of the normalized Legendre functions P(cos 9). 

four normalized functions, and the designations for the corresponding 
curves, are 

PCS = P2(cos 6) = 0.936 (5 cos 3 6 - 3 cos 6), 

Fi 3 = Pa(cos 0) = 0.810 sin 6 (5 cos 2 6 - 1), 

F 23 = Pi(cos 0) = 2.563 sin 2 8 cos 0, 

= Pl(cos 0) = 1.046 sin 3 8. 



The nodal points were obtained by solving the equations P (cos 0) = 0; 
the points of maximum values were obtained by solving the equation 



= 0, 



dd 

where m = 0, 1, 2, 3. 
6.7 Geometrical Interpretation of Surface Spherical Harmonics. 

Let us consider now the geometrical interpretation of the Legendre 
functions and tesseral harmonics which have been discussed in the pre- 
vious sections. 
The function 



(A cos mrj + B sin miy)P(cos 0) 



(49) 



represents a surface spherical harmonic of fcth degree and mth order. 
If m = 0, the function has the form PA- (cos 0), which is Legendre's 

9 Condon and Morse, "Quantum Mechanics," p. 55. 
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coefficient of the first kind, of degree k. This function is a polynomial of 
degree k and therefore has k distinct zero points between cos = 1 
and cos 6 = +1. As shown in the curves in Fig. 26 and Fig. 27, 
these " nodes " are arranged symmetrically about cos 6 = 0, i.e., 
e = 7T/2. Hence, on a sphere with the origin as center, the function 
Pjfc(cos 6) becomes on k different circles, which, as shown in Fig. 29, 
correspond to different degrees of " latitude," that is, they possess poles 
at 6 = and 8 = TT. These circles are symmetrical with respect to the 
" equatorial " circle, and if k is odd, the latter is one of the set of circles 
for which P k (cos 0) = 0. Furthermore, as shown by the plots of the 
functions, since the value of any function PA? (cos 6) exhibits k 1 
loops, there are 2 (k 1) circles parallel to the nodal circles at which 





FIG. 29, Geometrical illustration of , 
zonal harmonics. 



FIG. 30. Geometrical illustration of 
tesseral harmonics. 



the function has the same absolute value. It is for this reason that 
the Legendre coefficients of zero order are known as zonal harmonics. 
The point = is designated the pole, and the diameter through the 
pole, the axis of the zonal harmonic. 

For m greater than and less than fc, the functions are represented 
by the expression 



(A cos my + B sin wry) sin m 6 



This may be written in the form 






d m Pk (cos ) 






(50) 



where tan d ~ A/B. It vanishes for rmy = 5 and mrj = IT 6. On 
a sphere, as shown in Fig. 30, this corresponds to m great circles through 
the pole = (circles of " longitude ")> distributed symmetrically, so 
that the angle between the planes of any two consecutive circles is 
equal to ir/m. 
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The factor sin w is equal to zero only at 6 = and 6 = ?r. The dif- 
ferential coefficient d m P^(x)/dx m is represented by a function which is 
the wth derivative of a polynomial of degree k. Thus the highest 
power of x has the value k m t and the function has k m zeros on 
circles with = as pole, which are arranged like the corresponding 
circles in the case of the zonal harmonics. Since the two sets of circles 
intersect orthogonally, these harmonics are designated tesseral har- 
monics. (See Fig. 30.) 

The sum of the 2fc + 1 tesseral harmonics, for which the general ex- 
pression is given by (49) or (50), is known as a surface spherical Aar- 
monic of degree k. 

For m = fc, the differential coefficient becomes a constant factor, 
and the spherical harmonic is of the form 

VA 2 + B 2 - sin(fc7 ? + 8) sin* 9. (51) 

As pointed out already, this vanishes on k great circles passing through 
the points = and 6 = TT, the angle between 
the planes of any two consecutive circles being 
7r/fc. Since the sphere is thus divided up into 2k 
sectors, as shown in Fig. 31, these functions are 
known as sectorial harmonics. 

For any given value of 17, the expression in (51) 
is evidently of the same form as that in (48), and 
it is seen that, as the value of k is increased, the 
function tends to assume appreciable values in an 
increasingly narrower region symmetrical about 
the equatorial plane. The interpretation of this 
result from a physical point of view is considered in the following 
section. 

6.8 The Physical Significance of the Characteristic Functions. We 
may now consider the significance of the somewhat tedious calcula- 
tions and seemingly complicated results that have been derived in the 
previous sections. 

The problem to be solved is the following. Given a diatomic molecule, 
what will be the possible energy states and modes of rotation for such 
a molecule? The problem first originated because of the observa- 
tions on the temperature variation of the specific heat of diatomic 
gases. In order to account for the increase in specific heat with tem- 
perature, it was found necessary to assume that, in addition to kinetic 
energy of translational motion, diatomic molecules also possess an 
energy due to rotation about an axis of symmetry passing through the 
center of gravity. 




FIG. 31. Geometrical 
illustration of sec- 
torial harmonics. 
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On the basis of the classical quantum theory, the method of deter- 
mining the discrete energy states of such a system was as follows: 
Referring to equation (5), the total energy is given by 

(52) 

In the case of the rigid rotator, p r = 0, and /irj == J> the moment of 
inertia. Hlence, the equation for the rotator with free axis is 



Since the evaluation of the integral V podB is rather involved, 10 we 

shall consider only the case of rotation in a plane. For this case, 
pe = and sin 2 = 1. Hence, 



p*- (i) 

From the canonical relation 

*H ~ 

A--.--O, 

it follows that 

p, = a, a constant (ii) 

= V2IE from (i). 
Therefore, 

/ 

/o 

Since, in accordance with the Wilson-Sommerfeld quantum con- 
dition, 

27rV2IE = mh, 

where m = 0, 1, 2, 3, etc., it follows that 






In the case of the rotator with free axis, the calculation leads to the 
same result, where m is the sum of the two quantum numbers, one 
corresponding to p^ and the other to p$. 

10 Full details will be found in M. Born's "Atommechanik" and Sommcrfeld's 
"Atombau und Spektrallinien." 
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But observations on band spectra, in which the lines constituting 
individual bands are due to transitions between states differing in 
amounts of rotational energy, showed that this result was not quite 
satisfactory. On the basis of the S. equation, as shown in equation (14), 
the same result is deduced, if it is assumed that the molecule is capable 
of rotation in only one plane. The result deduced in equation (25) is, 
however, in very good agreement with observations on band spectra. 
Hence, we conclude that diatomic molecules possess two modes of 
motion about their center of gravity, one in which there is a rotation in 
the plane containing the axis of the molecule, about an axis of symmetry 
at right angles to this plane, and another which corresponds to a pre- 
cession of the axis of the molecule about the axis of symmetry. 

Now it is the essence of the S. equation that it starts with this physical 
model and then, instead of discussing the consequences to be deduced 
from this model by classical mechanics, considers a partial differential 
equation which is derived from the physical model by a definite mathe- 
matical procedure, and which represents mathematically the propaga- 
tion of a wave motion. We abandon, as it were, the concrete, tangible 
model of a dumbbell-shaped mass rotating about an axis and con- 
sider instead the nature and properties of certain wave patterns ob- 
tained by solving the partial differential equation. 

In the previous sections it was found that corresponding to each 
energy state of quantum number ft, as defined by equation (25), there 
are 2k + 1 characteristic functions which represent 2k + 1 different 
possible modes of vibration. The next question to be considered is 
this. What is the physical interpretation of these functions which we 
recognized as Legendre polynomials of the first kind? 

As in the case of the characteristic functions <j> (" amplitude " 
functions), deduced in solving the problem of the linear harmonic oscil- 
lator, we assign physical interpretations to <$ or </> 2 . In the case of the 
rigid rotator in three dimensions, the function </> is defined by the relation 



where N is the normalizing factor for the particular Legendre poly- 
nomial. This is determined from the relation 

1. (54) 

Hence (l/N) 2 {P(cos 0)} 2 sin QdB is regarded as representing the 
probability of locating the particle in the region on the surface of a sphere 

lying between the zonal circles and 8 + d6, while f j 6 * m V l ' w1? di\ = 
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f J diy, represents the probability of occurrence in the meridian section 

located between the angles t\ and t\ + dvj. Evidently this probability 
is independent of 17. 

From Figs. 22a and 226 it follows that the element of area dA on 
the surface of a sphere of unit radius is given by 

dA = sin 



and the probability of locating the particle in this area at the angles 
and q is 



PdA = --5 {P*(cos 0)} 2 sin ftttd*. 

ATM 



(55) 



Hence, 



corresponds to a probability per unit area or " probability density. 



" 11 




.2 
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80 100 
+0 
FIG. 32. Plots of the zonal distribution functions corresponding to k = 3. 

From values of the normalized polynomials as functions of 0, such as 
illustrated in the plots in Fig. 28, it is possible to calculate both P and 
P sin 6 9 and results obtained in this manner are illustrated by the plots 

11 The function P corresponds to the product 6^(0) $0?) given by Pauling and 
Wilson, "Introduction to Quantum Mechanics," pp. 132-133. 
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shown in Figs. 32, 33, 34, and 35. The first two figures give values of 
P sin 0. The designations on the curves and the corresponding normal- 
ized functions are as follows: 



{Pa(cos 0)} 2 sin 0. 
{Pl(cos0)} 2 sin0. 
s 0)} 2 sin 9. 




20 40 60 80 100 120 140 160 180 
FIG. 33. Plots of the zonal distribution functions corresponding to k = 3. 

For comparison there has also been plotted, as curve A o in Fig. 33, 
the function * s ; n $ 



Since 



/*- 

Jo 



sin OdB = 2 
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and the total area under each of the curves shown in Fig. 32 and Fig. 33 
is equal to 1, the average ordinate is given by l/v = 0.318. This has 
been indicated by the straight line BB in the 
two figures. 

If now we compare two zones of equal widths, 
at the angles 0i and 6 2 , it is evident that the 
areas of the two zones will be 2ir sin OidO and 
2ir sin 2 d0, where dB is the width of each zone, 
and the radius of the sphere is taken as unity. 
Hence the relative values of the probability 
density, as given by P in equations (55) and 
(56) and plotted in Figs. 34 and 35, are quite 
different from the values P sin shown in Figs. 
32 and 33. In the former, the distance from 
the center to any point on the curve gives the 
relative value of P at the corresponding value 
of e. In Fig. 34 the function {P(cos 0)} 2 /# 2 
has been plotted and should be compared with 
the curve A 03 in Fig. 32, while the plot in 
Fig. 35 which corresponds to {P 3 (cos 6)} 2 /N 2 
is to be compared with the curve A 33 in 
Fig. 33. 

In terms of the model of a diatomic mole- FIG. 34. Probability den- 
cule these plots indicate that the axis of the sity function correspond- 
molecule will tend to be oriented with respect ing to associated Legendre 
to the axis of symmetry in those directions for function P8(cos e} ' 
which P is a maximum. This interpretation is most readily evident 
from the plot A 33 in Fig. 33 and the corresponding plot in Fig. 35. 





FIG. 35. Probability density function corresponding to associated Legendre 

function P|(cos 0). 

In this case there is a relatively narrow region about the value = ir/2 
for which P is a maximum. As the value of k is increased (keeping 
m = AJ), the width of this region decreases rapidly. That is, in the 
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rotational states of higher energy content the molecule will tend more 
and more to rotate about an axis of symmetry at right angles to the 
axis of the molecule. 

6.9 Angular Momentum for Motion of Rotator. In Chapter II, 
it was shown that in case of motion along a coordinate x, the corre- 
sponding momentum' is obtained by solving the equation 



If the result of performing the operation on the right-hand side of 
this equation is of the form <, where a is a constant, the conclusion is 
drawn that this will be the value of the momentum observed in any 
experiment arranged for this purpose when the particle is in the state 
designated by the eigenfunction 0. 

In the case of angular momenta equation (57) is not applicable, as 
appears from the following consideration. 

As shown in section 8, the Hamiltonian form of expression for the 
energy E is given by 

E =^ 



If the same rule were followed as that used for converting an expres- 
sion for E in terms of rectangular coordinates and their corresponding 
momenta into a S. equation (see Chapter II), the resulting differential 
equation would be of the form 

h* id 2 . I < 



that is, 



l 



which obviously is not identical with equation (11), and is not the 
correct form of S. equation to represent the particular problem. 

We will now try to deduce a rule for calculating the angular momenta 
with respect to 6 and 17 for a rotating body, on the basis of wave me- 
chanics. As a first step we consider the relation which exists according 
to ordinary mechanics between the angular momentum with respect 
to the z-axis (the axis from which is measured) and the linear momenta 
p x and p v , with respect to the x- and y-axes, respectively. In order to 
simplify the calculations we shall assume that the motion occurs in the 
XOY plane only (see Fig. 22a), so that dfi/dt = and - */2. 
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We then have the following relations: 

x = r cos iy; dx = r sinijdij + cosijdr; (i) 

y = r sin 17; dy rcosijdty + sinqdr. (ii) 

Hence, 
o% 2/cte = (xrcosrj + yrsmri)drj + (x sin 17 ycosij)dr = r 2 diy, (iii) 

since x 2 + j/ 2 = r 2 , and the coefficient of dr is equal to zero. 
But 



oc 

where M g denotes the angular momentum with respect to the 2-axis. 
Therefore we have the relation 

M z = (xy - yx) = xp y - yp x , (58) 

which is of extreme importance in classical mechanics, since it enables 
us to express angular momentum in terms of the rectangular coordinates 
and their associated momenta. 

Now in deriving the S. equation from the expression for the energy in 
terms of rectangular coordinates and associated momenta, we set 

h d h d 

P * = ^T* and *-<V 

We therefore conclude that in quantum mechanics we are justified in 
assuming that M z may be used as an operator, which is defined by the 
relation 

(59) 
v 



- . *(*. IV 
27rt\ dy dx/ 



Now let us assume that we have any function F which is a function of 
the polar coordinates t\ and r, or of the coordinates x and y. We have 
the following relations between the differential coefficients: 

dF_<W fa SF dy^ 

drj dx drj dy dq 

dF dF dx , dF dy 

and = -- 1 -- 

dr dx dr dy dr 

Substituting from (i) and (ii) it follows that 

dF dF dF f d d\ 

= x -- v = I x -- y ]F. (iv) 

dr, dy U dx \ dy y dx) w ' 
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Hence we conclude, by comparing (iv) with (59), that as an operator 

AT. -;-. (60) 



Thus, in the case of the rotator with fixed axis, the normalized func- 
tion, as given by equation (19), is 



Therefore, in order to determine whether the angular momentum 
has a definite value, we consider the equation 



That is, Mg operating on the function Z, yields as a result a constant 
multiplied by Z. Hence, we conclude that an experiment arranged to 
determine the magnitude and sign of the angular momentum would lead 
to a value m/i/(27r), depending on the relation between the direction 
of rotation and that of the perturbing field. Here the result deduced by 
the operator method is identical with that deduced by ordinary me- 
chanics, when the quantizing principle is introduced. As stated pre- 
viously, the observations on band spectra show that it is not correct 
to treat the problem of a rotating molecule as one in only two dimen- 
sions. It is therefore necessary to determine what the form of the 
operator must be for the case of a rotator in three dimensions. 

This problem is somewhat more complicated because, as is evident 
from equation (53), the angular momentum terms for 6 and 17 do not 
enter into the expression for E in the same manner. It turns out 12 that 
under these circumstances it is most convenient to calculate the square 
of the resultant angular momentum vector M 2 , which is defined thus: 

M'-M' + M' + M. 2 , 
and as an operator, this is denned by the relation 



u This is discussed more fully by E. C. Kemble, Phys. Rev., Suppl., 1, 157 (1929); 
also see J. Frenkel, "Wellemnechanik," pp. 248-253 (1929 edition). 
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Introducing spherical coordinates, and proceeding as in the case of 
the single variable q, it may be shown that the differential equation for 
the determination of M 2 is of the form 



That is, M 2 as an operator has the form 



where B is an operator of the same type as the Laplacian operator V 2 . 
In fact, equation (4) may be written in the form 

9 1 1 /o 3 * 2 *V\ 1 , 1 

v2 , = _ , + _^.__ +r2 ._j = _ fl , + _ 
Equation (62) may therefore be written in the form 



= 0. (63) 

Similarly, equation (11) may be written as 

2J^ = 0, 

the solution of which, as shown previously, is 



Hence, the solution of (63) must also be the same. That is, 

k(k + l)ft 2 
2 



while, as deduced in equation (61 ), 

M, = ^- (65) 

Equation (64) leads to the conclusion that the total angular mo- 
mentum of the rotating system may be designated by a vector whose 
magnitude is 

h 
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and that the component of this vector along the 2-axis is given by mh/2ir. 
In terms of a unit vector of magnitude ft/(27r), the total angular momen- 
tum and the component of this vector along the z-axis are therefore 
\/k(k +1) and w, respectively. 

These results are specially significant in connection with the problem 
of the hydrogen atom, which is discussed in the following chapter. 



SUPPLEMENTARY NOTE 1 

EXPANSION OF AN ARBITRARY FUNCTION IN TERMS OF AN ORTHOG- 
ONAL SYSTEM OF FUNCTIONS 

In subsequent discussions we shall have occasion to make use of the 
very important property of orthonormalized functions which is ex- 
pressed in the form 



^ m dv = (ra ? n) 

= N 2 (m - n), 

where <t> n and <t> m are any two eigenfunctions of the system, N is the 
normalizing factor, dv is the element of volume, area, or length, and the 
integration is extended over the whole region in which the functions are 
physically significant. (This region is usually designated the " con- 
figuration space.") 

As has been mentioned previously, the simplest type of such expres- 
sions is the trigonometric functions 

-= sin mO and - cos nO, 

VTT VTT 

for which the limits are and 27r, so that 

/27T /27T 

I cos mO cos nddO = I sin mti sin nOdO = (m 7* n)\ r . 
Jo Jo = TT (m = n)J (1) 

By means of this relation it becomes possible to develop any given 
function of 8 in terms of the sines or of the cosines of multiples of 6. 
Series involving only these trigonometric functions, that is, series of the 
form 

OQ + a\ cos x + a 2 cos 2x + . . . + a n cos nx + . . . 
bi sin x + & 2 sin 2x + . . . + b m sin mx + . 

are known as Fourier's series. The possibility of expressing any arbi- 
trary function of 6 in terms of such a series may be illustrated by the 
following examples. 

(1) It is required to develop the f unction /(fl) = 6 as a sine series for 
the region 6 to = TT. We have 

6 = ai sin 6 + a 2 sin 20 + . . . + a n sin n8 + . . . 
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Let us multiply each side of this equation by sin nddB and integrate 
between and *. Then 

/ 6 sin nddB = ai / sin sin nddti + . . . + a n I sin 2 nBdff + . . . 

t/O /0 /0 




20406080100 140 





FIG. 36. Illustrating Fourier's series analysis. 

Because of equation (i) all the terms on the right-hand side, except 
the one involving sin 2 n0, vanish. Hence we obtain a relation for de- 
termining a n , which is of the form 



2 /* . 

a n = - I si 

7T/0 



sin nOdB. 



(ii) 



Now 



d(0 cos nfl) = ruS si 



+ cos n0 



= n0 sin nOdB H d(sin n0). 

n 
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Hence, 

sin n0d0 = cos nJQ\ + -5 

o n Jo tr 

since sin n0 = for both = and = TT, while cos nv = (~l) n . 

In a similar manner all the other coefficients, ai, a 2 , etc., may be de- 
termined and the required development has the form 

10 sin 20 sin 30 \ 

r 5- + ~l~~* * ) (lll) 

L & O / 

Figure 36 shows in the left-hand series of plots, the straight line y = 
from = to = TT (that is from to 180 degrees), and the successive 
approximations to this line which are obtained by taking 

yi = 2sin0; 

2/2 = 2 sin sin 20; 

3/3 = 2 sin - sin 20 + (f ) sin 30. 

It will be observed that while the series does not converge very rapidly, 
the curves gradually approximate y = more and more closely, with 
increase in number of terms. 

(2) It is desired to express x 2 as a cosine function for the range 
x = c to x = c. 

We introduce a new variable, z = irx/c, so that z = TT for x = c, 
and 2 = TT for x = c. Then, 

a + ai cos z + 2 cos 2z + . . . (iv) 

Multiplying each side by cos nzdz and integrating between the 
limits, all coefficients on the right-hand side, except a n I cos 2 nzdz, 

Jir 

vanish, and we obtain the relation for determining a n , of the form 



C /T /T 

~2 I z 2 cos nzdz = a n I cos 2 racfe = a n IT, 

7T t/ v / ir 

that is, 

c 2 F* 

a n = -g I 2 2 cos n d. (v) 

V JT 



Now d(s 2 sin nz) = n 2 cos nzdz + 2z sin nstfe; 

d(z cos n) = --n sin nzdz + cos nzcfe. 
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Hence, 



r* & T 2 c r 

I z 2 cos nzdz = sin nz I z sin nzdz 

J-* n J_, nJ-ir 

T 

z sin nzdz (since sin nir = 0) 

= ~2 \z cos nz 2 l cos nzdz 

47r( l) n 1 sinnz"] T 47r( l) w 

= o 2 ^ 2 * VVI) 

n^ n^ n J_ T n^ 

Therefore, 

_*>:. w 



An " TrV 



The coefficient a is determined from the relation 
c 2 



c 2 /* T C* 

-5 I 2 2 d2; = I a d2, since cos = 1. 

TT*J-Tr J-TT 

That is, 



2c 2 - 3 






Consequently, the development for x 2 is of the form 

2 == <L._^( _L oa ^. + l cos ^_ "\ (viii) 
x " 3 7r 2 \ c 2 2C S c 3 2C S c ' " / ( l) 

The curves on the right-hand side of Fig. 36 show the parabola 
y = x 2 at the top, and the straight line y = c 2 /3 = 2 2 /3, which is 
evidently the average value of y over the range ^ | x \ ^ 1 2 1. The 
other curves correspond to the expressions: 

c 2 4c 2 irx 

Vi =s o "TJ " cos ' 
3 TT c 



, c 2 
= l/i + ^ cos ; 

4 c 2 Sir* 
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As shown by the plot of 3/3, this expression corresponds fairly closely 
to y = x 2 for the range x = 0.75 c, and, by using more terms, the 
range over which the series represents the parabolic function may be 
made to approach the limits x = c very satisfactorily. 

More generally, any function J(x) can be expressed within a definite 
range of values of x in the form 

00 00 

f(x) = a + 2 a n cos nx + 2 b n sin nx, 
i i 



where 



1 C c ff N nirx TT 

~ I f(X) ' COS - -OX] 
TTt/ c C C 

1 r c ., . 

- I f(x) 
TTt/ c 



. ., . . . 

o n = - I f(x) sin --- do;. 
TTt/ c c c 

Such a representation in terms of trigonometric functions is known 
as a " Fourier's series " expansion for/(z). 

The possibility of obtaining such developments of an arbitrary func- 
tion depends, evidently, upon the existence of the orthogonality relation 
expressed by equation (i). The same type of reasoning may be ap- 
plied to develop an arbitrary function of x between the limits and oo 
in terms of a series of Hermitian or Laguerre polynomials. 

Thus if f(x) is a function which tends to vanish for x = QO , we can 
obtain the coefficients a n in the series 



from the relation 



n 

Jo 



= 2 n - l n\\fa /""/Me 2 -H n (x)dx. (ix) 

t/O 

Similarly, /(0), an arbitrary 13 function of 6, may be represented in the 
range TT > > by the series of Legendre coefficients of zero order, in 
the form 

/(0) = j4 -Po(cos0) + AiPi(cos0) + . . . + A n P n (cos0) + . . . 
18 " Arbitrary" in the sense that it is possible to plot the function graphically. 
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where 



2n + 1 r* 
= -^- I f(B) ' ^n(cos 0) sin 0d. (x) 

A t/O 

Even if the integral in equation (ix) or (x) cannot be calculated by 
direct integration, it can always be evaluated by plotting the integrand 
(that is, the expression to be integrated) as a function of x or and 
determining the area under the plot graphically. However, it is 
usually possible for the experienced mathematician to develop a con- 
vergent series for the integral, by means of which its actual value may be 
determined. 

More generally, if an arbitrary function ^ can be developed in terms 
of an arthorwrmalized set of functions <fo, #1, . <An> such that 

n 

t = 2>n</>n, (Xi) 





then 



<*n = j Hndv, (Xii) 



where the integration is carried out over the configuration space. 

In case ^ is a complex function, then < n must be replaced by $ n , so 
that a n is real. 
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CHAPTER VII 
THE HYDROGEN ATOM 

7.1 Bohr Theory of the Hydrogen-Like Atom. In Chapter IV it 
was shown that, on the basis of the Bohr theory, the discrete energy 
states of the hydrogen-like atom (of nuclear charge +Ze) are deter- 
mined, in accordance with the Wilson-Sommerfeld quantum con- 
ditions, by the relation 

_ 

En 



RchZ 2 

= - > (2) 

where R = ^ is known as the Rydberg constant, 1 

Furthermore, it was shown that, for any given value of the total 
quantum number n, the number of possible electronic orbits is deter- 
mined by the series of values for the azimuthal quantum number k = n, 
n - 1, . . . 1. 

For any state of quantum number n, the semi-major axis is given by 
the relation 



" 2E n 
= nV, (3a) 

where a, = - (36) 



is the radius of the Bohr orbit for the state n = 1. For the case Z = 1, 
n = 1, that is, the normal state of the hydrogen atom y the radius is 

1 It is customary to designate this particular value of the Rydberg constant by 
#00. As pointed out in Chapter IV, it is necessary in calculating E n to take into 
account the motion of the nucleus as well as that of the electron about their common 
center of gravity. In that case ju corresponds to the "reduced mass," which becomes 
equal to that of the electron for infinite nuclear mass. 
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designated by 

h 2 

(3c) 



and is known as the radius of the normal Bohr orbit. 

Substituting the values for the constants in equations (2) and (3c), 
the results obtained are 

a = 0.5282 X 10~ 8 cm., 
and #00 = 109737.42 cmr 1 

7.2 Hydrogen Atom as a Potential Barrier Problem. Before dis- 
cussing the solution of the appropriate S. equation for this problem, 
it is of interest to point out a method by which an approximate solution 
may be derived for the values of the series of discrete energy states. 

As N. F. Mott has pointed out, 2 " a hydrogen atom is simply an 
electron bound by an electrostatic force, which pulls it back if it tries 
to get away from the nucleus. The wave function therefore will vibrate 
in normal modes, and we shall only have wave functions describing the 
behavior of electrons of certain discrete energies." 

An approximate calculation of the magnitude of these energy states 
may be made as follows: 

It was shown in Chapter III that, in the case of an electron in a po- 
tential " box," the series of discrete energy states is given by 

ra 



where m is an integral value and 2a is the extent of the region between 
the barriers. Also it was shown that, for the lowest of these states 
(m = 1), the corresponding characteristic function is 



(3.34) 
where 

X = Jl - 4a. (S.32) 




In the case of a hydrogen atom, the field of force is defined by the 
potential energy Ze 2 /r. Since the total energy cannot exceed this 
value for any given value of r, it follows, if 2a denote the diameter of the 

2 N. F. Mott, "An Outline of Wave Mechanics," p. 63. 
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electron orbit for the lowest state, that 

E l - - ~ (i) 

and 

X - 4a - k (ii) 

From (i) and (ii) it follows, by eliminating a, that 



fe 2 

which differs from the expression in (1) by the ratio 32/(2ir 2 ). If X 
had been assumed equal to no, instead of 4a, the two results would be 
identical. 

7.3 The Schroedinger Equation for the Hydrogen-Like Atom. In 
the Schroedinger method of calculating energy states the starting point 
is the same as in the classical method, that is, we consider a system 
consisting of nucleus of charge + Ze, and of an electron for which the 
potential energy is given by 7= Ze 2 /r, and the kinetic energy T, by 
TjiJLV 2 . In terms of spherical polar coordinates, the total energy is 
given by 



E - (i* + r*P + r 2 sin 2 ft) 2 ) --- (4) 

Zi T 

As shown in equation (5.8), the S. equation deduced from this re- 
lation is of the form 



= 0, (5) 

where a 2 = 8ir 2 /x/A 2 , and the Laplacian operator is given by the relation 



99 \ M 3rj \sin 9 

The solution of this partial differential equation must yield <f> = # (r, 6, ij) , 
and, as in the case of the rigid rotator, we assume that it is possible to 
express this solution in the form 
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where S(r) is a function of the radius only, and Y (0, TJ) denotes a function 
of the angle variables. 

We thus separate equation (5) into two differential equations, one 
in r, and the other in 6 and rj, which are as follows: 3 



where C is an arbitrary constant which plays the same r61e in the present 
case as the constant m 2 , which was introduced in solving equation (tf.ll). 
Equation (6) is evidently identical with equation (0.11), and the 
solution is therefore a tesseral harmonic of the form 4 

Y(8, r?) = Pf (cos 0) e im \ 

C - Z(l + I), 5 
where 

I = 0, 1, 2, etc. 
and 

m = 1, (l- 1) . . . dbl, 0. 

As pointed out in the case of the rigid rotator this signifies that the 
particular energy state corresponding to any given value EI is degenerate, 
inasmuch as it may be represented by any one of 2Z + 1 independent 
eigenfunctions. However, it will be observed that it is not possible from 
the solution of equation (6) to determine the value of EI. For this pur- 
pose it is necessary to solve the radial equation (7), which, by substitut- 
ing the value for C, takes the form, 

d 2 S , 2 dS , L , 25 1(1+1)1 



where 



and 



, (8) 

r dr \ r 2 ' 



B 



(9) 



h* 
8 Compare section 0.4. 

4 The tesseral harmonics are usually indicated by the symbol Yi t m where / and m 
are the two integers used to designate the associated Legendre functions. 

5 The symbol I is introduced instead of k which was used in the previous case, in 
order to bring the notation into agreement with spectroscopic usage. 
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7.4 The Radial Schroedinger Equation. Laguerre Functions. In 

solving the radial equation, we must be guided by the conclusions 
deduced in ordinary mechanics about the motion of a body in a central 
field of force. The best illustration of this is furnished by the investi- 
gations on the possible orbits of the planets in the gravitational field 
of the sun. We know that in this case two types of orbits are possible, 
(1) hyperbolic orbits for which E > 0, and (2) elliptical (including 
circular) orbits, for which E < 0. In the quantum mechanics problem 
we must therefore seek solutions corresponding to these two cases. 

Let us consider first the case E > 0. For very large values of r, 
all the terms in (8) involving 1/r or 1/r 2 may be neglected, and the 
equation becomes 



0. (10) 

The solution of this equation is evidently 

S = #i* Wl + # 2 e- Wl . (11) 

According to (9), 

VZ = 2 -^ = ^, (12) 

h X 

where X is the de Broglie wave length for a particle of kinetic energy E. 
From the discussion in Chapter II, it is seen that equation (11) repre- 
sents a combination of two streams, in one of which the particle is 
receding from the origin, and in the other it is approaching the origin 
with a constant momentum V2pE. Obviously E may vary con- 
tinuously from to any positive value. That is, there are no discrete 
energy states, and this gives an interpretation of the fact that beyond 
the limits of the line spectrum (which is due to a series of discrete states 
of negative energy) there is also observed a continuous spectrum, 
which must correspond to states for which E > 0. 

The much more interesting case is that for which E < 0, and for which, 
according to the classical Bohr theory, the system exhibits a series of 
electronic orbits, each of which is distinguished by definite quantum 
numbers and definite values of the orbital constants. 6 

Since E is negative, A in equation (9) is positive, and it follows 

6 The following discussion of the problem is based on that of M. V. Laue f " Hand- 
buch der Radiologie," Vol. VI, Part I, pp. 42-48. An elementary presentation is 
also given by N. F. Mott, "An Outline of Wave Mechanics," pp. 71-77. 
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from equation (12) that the magnitude 

1 



2V -A 

which is real, has the dimensions of a length. 
We therefore introduce the dimensionless variable, 



(13) 



and consequently equation (8) becomes 

1 2Ba 



(14) 



dx 2 ^x dx ' I 4 ' x - " (15) 

The problem is to find solutions of this equation which are finite and 
continuous for all values of x and which vanish at x = . The point 
x = is a singular point, since l/x becomes infinite there. It is there- 
fore necessary to investigate the behavior of the solutions at this point 
and, in order to do this, we assume that it is possible to express the 
function S in the form, 

S = x'v, (10) 

where s is a constant and v may be expressed as a polynomial in powers 
of x, of the form 

00 



Hence S = V Q X* + vix s+l + . . . + v n x* +n , 

ds s-i , / , ^ .. 

and = SV Q X + (s + l)v\x + . . . , 

dx 



while = s(8 - I> z 8 ~ 2 + (s + l^is- 1 + . . . 

Substituting in equation (15) it is found that the coefficient of v Q x*~~ 2 
is given by 

S(S - 1) + 2S - 1(1 + 1) = 8(8 + 1) ~ 1(1 + 1). 

As we approach x = 0, all terms involving higher powers of x than 
s - 2 may be neglected, and if S is to be finite at x = 0, it is necessary 
that x 8 " 2 = 1; that is, the series must not have any lower powers of x 
than those given by the relation 
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It follows that s = I, or s = (I + 1), and since aT*""" 1 would become 
infinite at x = 0, we must choose the value s = I. Using this value in 
equation (16), substituting in (15), and dividing by x 1 , we obtain the 
equation 



For very large values of x, all terms involving 1/x may be neglected, 
and the equation becomes 

da? ~ 4 = ' 
the solution of which is 

* * 

v = c l 2 + C 2 2 . 

Since v must not increase indefinitely with increase in x (which is 
always positive), it follows that v should be of the form 

v = ~* g(x). (18) 

Substituting this function and its differential coefficients in (17) 
and multiplying by e* /2 , the result is 



_ 

dx 2 [ x \dx [ x 

If we examine the behavior of the series 



for large values of x in a manner similar to that used in the case of the 
Hermitian and Legendre polynomials, it is found that v will decrease 
exponentially and ultimately vanish for x > oo only if the series is made 
to end with the term in x?\ where j is determined by the relation 

2Ba = j + I + 1. 

Substituting for B and a from equations (9) and (13) it follows from 
the last equation that 

(20) 



"""" 1.2 / * i i i i \2 

That is, solutions of the S. equation which are finite and continuous 
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for all values of x, and which vanish for x = , are obtained only for 
the series of discrete eigenvalues defined by the relation 



where n = j + I + 1 = 2Ba 

and j ^ 0, 1 ^ 0, while n ^ 1. 

Equation (21) is identical with equation (1) derived by the Bohr 
theory, and is in satisfactory agreement with observations on the 
energy levels for hydrogen-like atoms. 

Let us now consider the differential equation (17) which has the form 

/y*/7 f> f\v\ I i* \ 

vtl v . ^ x , . _ ^ U/U .1 J, \ ^ /OO\ 

(ZA) 



Let 

p = 21 + 1 
and 

k = n + L 
Then 

n = (2n + 2Z 21 1 + 1) 



j = (n + - (21 + 1) 
= k - p, 

and substituting for v in equation (22) from equation (18), the resulting 
differential equation has the form 

^ + (p + i -*) Jr + (* - P) 9 (x) = 0. (23) 



This equation is satisfied by the associated Laguerre polynomial of 
degree (k p) and order p, (p ^ k), which is designated by L%(x). 
It is analogous to the associated Legendre polynomial of degree k 
and order m, P(x), and is derived in a similar manner from the Laguerre 
polynomial of zero order defined thus: 

(24) 

This function, it will be observed, bears a distinct similarity to the 
Hermitian polynomial defined in equation (5.18). It is a polynomial 
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of degree k in x which is given by the following series : 



=-0 

U2 fc2/t _ i\2 



t-2 

* 



2! 

(25) 



and satisfies the differential equation, 



From this, by differentiating p times, equation (23) is derived, which is 
satisfied by the associated Laguerre Junction, 



For k = n + I, and p = 11 + I, the associated Laguerre poly- 
nomial of degree n I 1 (= fc p) and of order 21 + 1 is given 
by the series 



(n - I - 1)!(S + 1)! (n - I - 2)1(21 + 2)111 



(n - I - 3)1(21 + 3)121 

(26) 



The first ten of these polynomials are as follows: 7 



--* + !; L}(x) = -l 

z) - * a - 4x + 2; L\(x) = 2x - 4, 

L|(aO = 2 

- -x 8 + Q* 2 - l&c + 6; L5(*) - -3* 2 + 18a; - 18, 

I|(a?) - -te + 18, 

Li(*) = -6. 
7 Condon and Morse, "Quantum Mechanics," p. 63. 
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Hence the solution of equation (17) is 

-*? 

and consequently the solution of the S. radial equation (15) is 

_*? 

Cf ,/V\ ~J n _ - 2 <r l T? l +l(<r\ (y7\ 

&nl\X) X 1/nl X Ju n ^.l \X}) \*it ) 

where, according to equations (9), (13), and (14), 

x - 




(28) 

and n and I are the two integers designating the eigenf unctions S(x). 
It is most convenient to measure r in terms of the radius of the Bohr 
orbit in the normal state of the hydrogen atom. Introducing the ex- 
pression of E n , given in equation (21), and the value of a , defined by 
(3c), into equation (28), the result is 



(29) 
and in terms of a , the eigenvalues are given by 



The associated Laguerre polynomials form an orthogonal system, 
and in the present case it may be shown that the complete expression 
which must be normalized is of the form 8 



and that the normalizing factor is given by the relation 

"L (3D 



If now, in accordance with equation (29), x is replaced by 2Zr/(na ), 
the normalized radial function becomes 



8 This is due to the fact, as shown in a subsequent section, that the function 
Si(r) r 2 is required for the physical interpretation. 
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Table 1 gives the expressions for the normalized functions S n i(p), 
where p = 2Zr/(na ) is used as radial coordinate. The spectroscopic 
designation of the corresponding energy " term " is given in the first 
column. 

TABLE 1 

NORMALIZED RADIAL FUNCTIONS FOB THE ELECTRON IN HYDROGEN-LIKE ATOMS 9 

2Zr 



acopic 
Notation n I L n 3. 





1 /Z\f - p 
2p-4 -r - 2 (2p - 4) 



Spectro- 

n I Ln+i Sn 

" 




2p 2 1 -3! 

ZVO V*/ 

1 /Z\f - P 

3a 3 -3( P 2 ~6p+6) - -rf-) * 2 (p 2 - 6p + 6) 

3! \o/ 



3p 3 1 24(p-4) 



3d 3 2 -5! 

Figure 37 shows plots 10 of the eigenf unctions S n i(r) associated with 
the values n = 1, 2, and 3 for the hydrogen atom (Z = 1), as functions 
of r/a . 

9 H. Bethe ("Handbuch der Physik," XXIV, Part I, p. 274) and Pauling and 
Wilson ("Quantum Mechanics/' Chapter V) give the expressions for the eigen- 
functions corresponding to values of n > 3, in which, in order to make the functions 
positive for small values of r, the sign has been changed. Consequently, the func- 
tion designated by these writers as Rni(p) is identical with S n i(p) as defined by 
equation (32). Obviously this difference in sign does not affect the value of the dis- 
tribution function {Sni(r) r} 2 . 

10 This figure is taken from the treatise by H. E. White, "Introduction to Atomic 
Spectra." Plots of these functions were first published by L. A. Pauling, Proc. Roy. 
Soc. t A114, 181 (1927). See also the summary by the same author in Chem. Rev., 
5, 173 (1928), and L. Pauling and E. B. Wilson, Jr., "Introduction to Quantum 
Mechanics," Chapter V. 
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It will be observed that the number of zero points, or nodes (excluding 
that at r = 0), along the axis of r (or p) is identical with the value 
j = n (I + 1). Hence, .;' is designated as the radial quantumnumber and 
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FIG. 37. Plots of normalized radial functions for different electronic 
states of hydrogen atom. 

is often denoted by the symbol n r . The comments of A. Sommerfeld 
upon this result are significant. 11 He writes: 

This leads to a simple wave mechanical interpretation of the quantum numbers; 
indeed, not only in the case of the radial coordinate and of the Kepler problem but 
in all cases where, using the polynomial method, we can apply the following defini- 
tion by forcibly breaking off an expansion in series, that is, by the degree of the result- 
ing polynomial: Quantum numbers denote the numbers of nodes in the proper functions 
that lie between the limiting points for the coordinate in question. This brings to mind 
the analogy of the vibrating string in which the ordinal number of an overtone is 
likewise measured by the number of nodes that lie between the fixed ends of the 
string. 

7.5 The Complete Solution of the S. Equation for the Hydrogen 
Atom. As stated at the beginning of this chapter, the complete solu- 
tion of the S. equation for the hydrogen-like atom must be an expression 
of the form 

<t>mm(r, 0, T,) - S nl (r) Yi m (0, ,), 
where 

Y lm (B, n) = X lm (d) - Z m (n) 

is a tesseral harmonic, which was defined in equation (.43), with 
the normalizing factor given by equation (5.46). Furthermore, ac- 
cording to equations (6.19), (6.39), and (6.42), the individual nor- 

11 Sommerfeld, '* Wave Mechanics," English translation, p. 72. 
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malized functions of 6 and 17 are given by the relations 



Zt \ __ _ -tWIIJ (*%&} 

m\Q) / c W*/ 

V2ir 

Table 2 gives the expressions for these normalized functions for 
values of I = 0, 1, and 2. 

TABLE 2 

NORMALIZED SPHERICAL EIGENFUNCTIONS 
I m 

^ 
1 * 

3\i 
4 







3 /5 
-i/- 
2V 3 



dbl - 1/ - sin 6 cos 

2 

The function Z m (ri) can be expressed in either the complex or real 
form thus: 

1 . 1 

Zi (77) = = e l1t or Zi C08 (77) = p cos t\ 
V2ir Vir 

" ' ^ ~~'" or Zi B i n (77) = p sin 17 



8n 
V27T VTT 

Z 2 (r7) = -= e 2il? or Z 2 cos (if) - ~J= cos 

VTT 



2L a (n) = -7= r 2il > or Z 2 8in (17) - -p sin 217 
V2ir VTT 



and so forth. 
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The change in the coefficient from 1/V2* to 1/V* takes care of the 
normalization. By means of the expressions in Tables 1 and 2, it is 
possible to calculate the expression for the normalized eigenfunction 
<l> n i m , corresponding to any given values of n, Z, and m. The functions 
for a few of the lowest states are as follows: 

For n = 1, 1 = 0, m = 0, which is the normal state for the hydrogen- 
like atom, 

Zr 

~'o. (35) 

For n = 2, there are four eigenfunctions, one corresponding to I = 0, 
and three to I = 1, as indicated in Table 2. The normalized functions 
are as follows: 12 

i /**/ _N f -5 



} / r7\ 4 Zr 



02io = 7=[ ] r cos 2o o (37) 



1 ( Z \% -|1 . 

211 = - (-) r sin 0- e 2fl o ^. (38) 

4V27T Vw 

From equation (20) it will be observed that all the eigenfunctions for 
any given value of n have the same eigenvalue 




The state is, therefore, of the degenerate type. As was shown pre- 
viously, there are 21 + 1 functions corresponding to any given value 
of I. But for a given value of n, I can have the values n 1, n 2, 
... 0. Hence the total number of eigenfunctions corresponding 
to a given value of n is 

E = {2(rc -l) + l} + {2(n - 2) + l} + . . . + 3 + 1 = n 2 . 

Thus the degree of degeneracy for quantum number n is n 2 . This 
conclusion may be illustrated by the following table giving the different 
possible values of the quantum numbers n, I, and w, and the corre- 

12 A table of the hydrogen-like wave functions for the values n = 3 is given in 
Pauling and Wilson, "Introduction to Quantum Mechanics," pp. 138-9, also in 
Ruark and Urey, "Atoms, Molecules and Quanta," p. 664, and Sommerfeld, "Wave 
Mechanics," p. 71. 
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spending spectroscopic designations for the lower energy states of a 
hydrogen-like atom. 

Quantum 
Number 

Spectral Designation 

Designation of<t> n I m 

Is 100 1 

2s 200 2 

2p 210 210 

211 2 1 d=l 

38 300 3 

3p 310 310 

311 3 1 1 

3d 320 3 2 

321 3 2 1 

322 3 2 2 

The integer n is known as the total quantum number. The number Z, 
which is associated with the azimuthal angle 6, is designated the azimuthal 
quantum number, while m, which is associated with the angle 77, is 
designated the magnetic quantum number. 

7.6 Quantum Mechanics Interpretation of the Characteristic Func- 
tions. In quantum mechanics the eigenfunction <t>, obtained by solving 
a S. equation, has no direct physical interpretation. But $</> or < 2 
(if the function is real) is interpreted as representing a probability of 
occurrence. In the case of the hydrogen atom, <t> n im(r, #> *?) is a function 
of three coordinates. Omitting the subscripts and considering only 
the normalized functions, the expression 



is regarded as denoting the probability of occurrence of the electron in 
the element of volume dv, at the point whose coordinates are r, 0, and rj. 
As shown in Chapter IV, 

dv = r 2 sin OdrdOdq, 

while the element of area on the surface of the sphere at r is 

dA = r 2 sin 



Since Z(n)Z(ri) = l/(2ir), it is evident that the probability of oc- 
currence of the electron is independent of the angle 17. 
Hence, 
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is the probability of occurrence of the electron per unit solid angle at 
any given value of 0. It is evident that this interpretation is equiv- 
alent to the statement in Chapter VI that P is a measure of the prob- 
ability of occurrence per unit area on the surface of a sphere described 
about the origin of coordinates. That is, the magnitude of P at any 
given value of is a measure of the electron density on the corresponding 
zone (since P is independent of 17). 

In Figs. 34 and 35 plots were shown of the function P in the cases 
I = 3, m = and I = 3, m = 3, respectively. Figure 38, taken from 




1=3 



FIG. 38. 



-3 f 
Electrons 



Angular distribution functions for different electronic states of 
hydrogen atoms. 



the treatise by H. E. White, 13 shows similar plots of the probability 
density distribution as a function of for different electronic states of the 
hydrogen atom. In the case n = 1, 1 = 0, the function P is spherically 
symmetrical, that is, P is independent of 0. But in the case n = 2, 
I = 1, three states are possible, corresponding to m = 0, =fcl. As 
White describes the plots: 

For these three states, P gives the charge distributions shown at the right and top 
in Fig. 38. Each curve is shown plotted symmetrically on each side of the vertical 
axis in order to represent a cross section of the three-dimensional plot. Three- 

13 "Introduction to Atomic Spectra," p. 63. In this book the symbol 8 m i is 
used for the function Pf* (cos0), and the latter is used to designate the derivative 
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dimensional curves are obtained by rotating each figure about its vertical axis. It 
should be pointed out that the electron is not confined to the shaded areas in each 
figure. The magnitude of a straight line joining the center and any point on a given 
curve is a measure of the electron's probability of being found in the direction of that 
line. 

These figures indicate that for all m = states, with the exception of s-electrons, 
the charge density is greatest in the direction of the poles, i.e., in the direction = 
and v. The exponent of * mi > being zero implies that there is no motion in the 17- 
coordmate and that the motion of the electron, i.e., the plane of the orbit, is in some 
one meridian plane through the ij-axis, all meridian planes being equally probable. 

For m = Z, P has its maximum in the direction of the equatorial 
plane, while for < | m \ < I, P has maxima oriented in definite di- 
rections. 

These plots correspond very well with the deductions based on the 
classical Bohr theory for the directions of orientation of the electronic 
orbits. In a subsequent section it will be shown that, on the basis of 
the S. theory, the total angular momentum of the electron in its orbit 
is given by M = VZ(Z + 1) h/(2ir) and that the component of angular 
momentum about the z-axis (the axis through 6 = and TT) is 
M z = m h/(2w). This means that, for absolute values of m greater 
than zero but less than I, the orbits are oriented with respect to the 
z-axis. 

Figure 38 shows the classical oriented orbits for each state, below the 
corresponding plot of P. The orbits are drawn slightly tilted out of the 
normal plane in order to show an orbit rather than a straight line. It 
should also be added that in the plots " for states m = the scale is 
approximately I /(I + 1) times that of the other states having the same 
value." 

Let us now consider the radial function. In this case, the square of 
the normalized function (S n i(r)} 2 should evidently be interpreted as the 
probability of occurrence of the electron per unit length at the point whose 
distance from the nucleus is given by the value of r. 

However, a much better physical interpretation of the behavior of 
the electron in a hydrogen atom is obtained by use of the so-called 
distribution function, designated by D, which is derived from S(r) 
thus. 

The volume of the spherical shell between the radii r and r + dr 
is evidently given by 

d /4 \ 

7- 1 - ?rr 3 ]dr = 4nr 2 dr. 
dr\3 / 

Hence, 

*vr 2 {S n i(r)} 2 dr = D dr (39) 
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is a measure of the probability of occurrence of the electron in a spherical 
shell of thickness dr and of radius r. 

Figure 39, taken from the treatise by H. E. White, 14 gives values of 
D as functions of r/a for different electronic states of the hydrogen 
atom. It will be observed that in all cases D = for r = 0. This 



3s 



2a 3 9a 6 12* 18a 



12 24 36 60772ao 




FIG. 39. Radial distribution function (D) for different electronic states of hydrogen 
atom, and Bohr orbits for comparison. 

arises from the fact that the volume of the spherical shell of thickness 
dr becomes infinitesimally small as r decreases to zero. Stated in other 
terms, D = at r = because of the factor r 2 . 

On the other hand, #< plotted as a function of r, or S 2 (r), may be 
quite large at r = 0. This is illustrated by considering the case n = 1, 
I = (the normal state of the hydrogen atom). From equation (35) 
it follows that 

2Zr 

, c . 

#1000100 = ~ ( ~~ j 



That is, the probability density is a maximum at the origin and decreases 

14 White, op. cit., p. 68. Similar plots are given in Pauling and Wilson, "Quantum 
Mechanics," p. 143. 
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exponentially with increase in r. Thus for r Oo/2, the value of fy> 
is 1/6 times (36.8 per cent) its value at r = 0. 
The function D for this state is given by 

(~\3 2Zr 
-)r 2 ~ a o. 
007 

It has a maximum at a value r = r m which may be calculated as 
follows: 

dD 
dr 



For 



r = r m , = 0. 



Hence, 



For 



o r _ 

"* m 



a . 



Figure 40 16 shows the function fooo for the hydrogen atom, at the 
top, p = 4>? 00 in the center, and D at the bottom, each plotted as a 

function of r (in Angstrom). The dis- 
tribution function exhibits a maximum at 
exactly that value of r which was deduced 
by Bohr as the radius of the circular elec- 
tronic orbit in the normal state. On the 
basis of the Bohr theory, it was also shown 
that the orbits for the 2p and 3d states 
should be circular and of radii 4a and 
9ao, respectively. A simple calculation 
shows that these are exactly the values at 
which the corresponding D-functions ex- 
hibit single maxima, as indicated by the 
Plots in Fig 39. 

Figure 41 gives a more graphical inter- 
FIG. 40. Plots of characteristic pretation of this function for the case 
function (0 100 ), density per n = 2 , Z = (spectroscopic 2s state). " If 
SoSLSltn^ - "odd *# the electron in a hy- 
state of hydrogen atom. dr g en atom replaced by an infinitesimal 

source of light, the net effect of the fluc- 

tuation in the instantaneous location of the electron over a period of 
time would result in an image which would be brightest at those points 
where the probability of occurrence is greatest." 16 

16 Pauling and Goudsmit, "The Structure of Line Spectra," p. 30. 
16 S. Dushman, J. Chem. Education, 8, 1074 (1931). 
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Figure 42 17 gives a photograph of a three-dimensional representation 
of the distnbution function for n ** 2, I * 1, w * (spectroscopic 
2p state). Most of the charge lies in the region indicated by the 
boundaries, but actually the density decreases exponentially with r 
in the space outside the boundaries. 





FIG. 41. Illustration of probability of 
occurrence of electron for excited 
state of hydrogen atom (n = 2, 
J-0). 



FIG. 42. The charge density distri- 
bution for n 2, I - 1, m 0. 
The boundaries of the figure are 
too sharp, for the charge extends 
throughout space, though most of 
the charge lies in the region indi- 
cated by the figure, 



7.7 Comparison between Deductions from Classical and Quantum 
Mechanics. The difference between the wave mechanics point of 
view and that of the Bohr theory is brought out by comparing this 
conception of a probability density distribution for the electron with 
that of an electron revolving about the nucleus in a definite elliptic orbit. 
According to the older theory, which was discussed in Chapter IV, the 
magnitude of the major axis of ellipse is determined by the value of n, 
while the ratio of minor axis to major axis is given by the value of 
k/n where k is designated the azamuthal quantum number. In the new 
theory, the latter has to be replaced by Vl(l + 1). With this modi- 
fication, the corresponding orbits, as deduced from the older theory, 
are indicated on eaoh of the plots of D in Fig. 39. The origin is taken 
as one of the foci of each ellipse, so that the distances from the origin 
along the axis of r to these curves give the maximum and minimum dis- 

17 H. C. Urey, J. Chem. Education, 8, 1114 (1931). 
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tances of the electron from the nucleus according to the Bohr-Sommer- 
feld theory. 

On the basis of classical mechanics the electron was confined in its 
motion to an orbit of definite dimensions, determined by the magni- 
tudes of E n and of k/n. Although the new theory replaces this con- 
ception by that of a distribution function for the occurrence of the 
electron as a function of r, there is this point of resemblance between 
the two theories, that the function D always exhibits an exponential de- 
crease with increase in r beyond the maximum value of the latter which is 
given by classical theory. 

This conclusion is readily deduced by means of the following argu- 
ment. 

Let us consider the S. radial equation (8) which, when multiplied 
through by r, becomes 

2dS f 2B 1(1 + 1)] c 

Sr 



dr 2 dr I r r 2 

Put R - rS, so that fl 2 - r 2 S 2 = D/4v. Hence, 

dR = dS 
dr dr 

d 2 S . 2dS 



dr 2 ' dr 2 T dr 
and equation (40) becomes 



dr 2 ^ r r r 2 

Substituting for .4 and B from (9), the last equation becomes 



From equations (1) and (3c) it is readily seen that 

2Z 



J 
= - I I ; and 

r a o r 



Then equation (41a) becomes 
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For very large values of r, the terms containing 1/r and 1/r 2 become 
negligible, and the differential equation assumes the form 



d?R / Z \ 

-7Y- ( - ) 

dr \na / 



A 

= 0. 



The solution of this equation which vanishes for r oo is 

Zr 
# = Ce a o, 

which shows that S = flr" 1 must decrease exponentially with increase 
in r. 

The condition that the function R shall exhibit a point of inflection 
is given by d?R/dr 2 = 0. Hence, either R = (which is physically 
untenable) or 

W + l) > a? + (AY . a 

r a r \na / 
Solving this quadratic equation in (1/r) we obtain the two roots 



1 



r 1(1 + l)eto 
Hence, 



n Vn 2 - 1(1 + 1) 



= ^ {n n 2 - 1(1 + 1)}. (42) 

Z 



For I = 0, 

r-**S or 

For 1(1 + 1) = fc 2 , 



2 




where r m and r are the maximum and minimum values, respectively, 
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of r. These values correspond to those determined by the Bohr- 
Sommerfeld theory for orbits of total quantum number n and azimuthal 
quantum number k. 

It is thus evident that the points of inflection on the distribution curves 
must occur at those values of r which represent the maximum values 
for the classical orbits. That is, the maximum value of the function D 
always occurs in the region inside the boundary deduced from classical 
theory. 

There are some other distinct points of similarity in the conclusions 
derived by means of both the older and newer theories. Thus, as 
mentioned previously, for the Is, 2p, and 3d states the values of D 
show single maxima at those values of r which are the radii of the 
corresponding Bohr circular orbits. 

A still more striking similarity in the conclusions deduced from the 
Bohr-Sommerfeld theory with those derived from the calculation of the 
function D is obtained by comparing the average value of r as computed 
by each method. 

According to equation (4-98), the average value of r on the basis of 
the Bohr theory is given by 



f = - I rdt 

T/0 






Z 

For the quantum mechanics case, in accordance with equation (5.30), 



I 



rD 2 rdr 
_ , 

fVdr 

t/0 



(44) 



where D, as defined by equation (39), is the probability of occurrence 
of the electron in a spherical shell of thickness dr at the distance r from the 
nucleus. 

For the normal state of the hydrogen-like atom (n = 1, I = 0), 
if we use normalized functions, the denominator in (44) is equal to 
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unity. Hence, 

;i _^ r 
4TT 2 s . 
. 



.^o f 
: 4Z Jo P 



where p = 2Zr/ao. 

The value of the integral in the last equation is given in tables of 
definite integrals (see Appendix III) as 3! Consequently, 



which is the same as that derived from equation (43) for k 2 = 0. 

More generally the average value of any power s of r, for any state 
of the hydrogen-like atom, is given by the relation 



f = .47rr 2 ^(r)dr. (46) 

/o 

The method of evaluating the integral in this equation has been 
discussed by I. Waller, 18 and in the case of s = 1, the relation derived 
in this manner has the form 19 

fr-W 

It will be observed that this relation is obtained from equation (43) by 
substituting 1(1 + 1) for the square of the azimuthal quantum number fc 
which was used in the Bohr theory. 

7.8 Relation between Kinetic and Potential Energy Deduced from 
S. equation. 20 For the classical case, the average potential energy is 
given, as shown in Chapter IV, by the relation 

F--2. (48) 

a n 

and, since E is a constant, the average kinetic energy must be given by 
the relation 2 

(49) 



18 Z. Physik, 38, 635 (1926). 

19 White, "Introduction to Atomic Spectra," p. 69; Pauling and Wilson, ''Intro- 
duction to Quantum Mechanics," p. 144. 

20 The remarks in this section are based on a discussion of this topic in lecture notes 
by Dr. F. Seitz. 
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This relation between the two forms of energy is regarded as a balance 
between the so-called centripetal force of attraction between electron 
and nucleus and the centrifugal force due to the orbital motions of the 
two particles. Classical mechanics does not provide any criterion for 
the determination of stationary states; this is supplied in the Bohr 
model by the Wilson-Sommerfeld quantum conditions. 

In quantum mechanics, on the other hand, we do not associate 
position with kinetic energy, and the " picture " which we obtain of the 
atom is that of the " electron cloud " for which the charge distribution 
is derived from knowledge of the wave function <t> n associated with the 
particular eigenvalue E n which characterizes any given state of the 
system. What inferences can we draw from this concept regarding 
the average values of potential and kinetic energy? 

The average value of V is given by 






Let us consider the case n = 1, I = 0, for which <i o is given by 
equation (35). 

/ / z \3 r * _?zr 

I $ioor- 1 4ioodT = 4~ re a o dr 

Jo 



Z /*"-. A Z 1A 

"pap = (51; 



Hence, 



where 01 is the radius of the corresponding Bohr orbit. (See equations 
(36) and (3c).) 
Similarly it may be shown 21 that for any other value of n 

Z 1 1 



a n 

and F=- = 2B, 

a n 

- Ze 2 

while T = = -E. 

2a n 

21 H. Bethe, "Handbuch der Physik," Vol. XXIV, Part I, pp. 282-286. 
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That is, the average value of 1/r and the average values of the po- 
tential and kinetic energies, as derived on the basis of quantum me- 
chanics, are identical with those derived in the Bohr theory. 

In ordinary electrostatic theory, the potential energy in the field of a 
positive charge Ze (at the origin), due to a negative charge distribution 
defined by the function ev (r, 6, rj) is given by 

F= - 

It is evident that equation (50) merely expresses the same relation, 
in which cr is replaced by | <t> n | 2 . 

It is also of interest to consider these conclusions, regarding the 
eigenfunctions and eigenvalues for the hydrogen-like atom, from an- 
other point of view which will be found of great assistance in dealing 
with other atomic and molecular systems. 

We can write the S. equation for any system in the form 

= E<l>, (52) 



where H is designated the Hamiltonian operator, and a 2 = 
That is, if <t> is an eigenfunction of the system, then the result of operating 
on it with H should be equal to E<t> where E is a constant which is 
known as the total energy. 

Multiplying both sides of (52) by $ and integrating over the con- 
figuration space, we obtain the relation 

f^H^dr = -^ f 

= l 

Since V is a function of the coordinate variables, E is a constant, and 
$ is assumed to be normalized, the last equation can be written in the 
form 

- ~ f #V VT + fvfodr - E. (53) 

As shown already, the second integral corresponds to the average 
potential energy, and therefore the first integral must correspond to the 
average kinetic energy of the particles constituting the system. In a 
subsequent chapter it will be shown that if the form of the function < 
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is chosen in such a manner as to make E a minimum, then ^ will be a 
solution of the S. equation (52). 

Now let us consider the factors which govern the actual values of each 
of the integrals in equation (53). For the purpose of such discussion 
it is convenient to assume that <t> is a real function of the three rectangu- 
lar coordinates. However, the conclusions which we shall derive are 
equally applicable to the case in which <t> is a complex function of any 
other set of curvilinear coordinates. 

In terms of rectangular coordinates, the left-hand integral in equation 
(53), which we shall designate by I T , has the form 



IT 9 I ITU 

* _.* f ^ I J1/M 

Now 

^ 

dx\dxj \dx. 

Hence 




ftC J-oo J-oo \ dX/ J-00 dZ 2 



<t>6 2 <t> 
, dx 2 



drc. 



But in order that < shall be a " sensible " solution of the S. equation, 
this function must vanish at the limits x = zfc , and the same con- 
ditions must apply with respect to the y and z coordinates. Hence, we 
obtain the relation 



/r =-5 I 11 ) +11 +( 
a 2 J l\dx/ \d?^/ \a 

which shows that IT is positive. Now d4>/cte, d^/dj/ and d</92 cor- 
respond to the components of the gradient of the function < with 
respect to each of the axes of coordinates, where we define the gradient 
dtfr/dn, as the rate of change in <t> with distance measured along the 
normal to the surface representing the function <. (See Appendix IV.) 
Hence, the last relation can be written in the form 



It is evident that in the case of a spherically symmetrical function, 
such as that for the lowest state of the hydrogen atom, dj/dn will be 
identical with d<t>/dr. 

Since IT is positive and the term involving V in equation (53) is 
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negative for stable states of the system, it is evident that these two terms 
tend to counterbalance each other. In order to minimize E we might 
attempt to minimize V by locating the electron in the hydrogen-like 
atom as near the nucleus as possible, since this would increase the 
absolute value of Ze 2 /r. But' this will tend to make T very large because 
of the increased value of d<t>/dn. That is, the function < will change so 
rapidly with decrease in r that d<l>/dn will assume very large values and E 
will not be as negative as possible. 

On the other hand, if the form of <t> is chosen in such a manner as to 
make f small, this means small values of d$/dn and a consequent 
" spread " over a large range of values of r in the function < and in 
the corresponding charge distribution function efaf). Such a function 
will not, however, give a charge distribution which is localized in the 
most satisfactory manner in the regions of negative V y with the result 
that, again, E will not be as low as possible. The best wave function 
will therefore have a form which is intermediate between these two 
extremes, and as shown previously, the compromise actually obtained 
is such that 

T = -|F, and E = T+V = %V. 

We can draw the same conclusions regarding the necessity for a com- 
promise between T and V from the Principle of Indeterminacy. For this 
purpose we shall write the expression for the average kinetic energy in 
the form 

f - Av. of (pl+ pi + p. 2 ), 
AH 

where p x , p yy p z are the components of momentum with respect to the 
three coordinate axes, and we consider the mean value of the sum of the 
squares of these components. From the relation 

Ap Ag ^ h 

it is seen that if we choose <t> in such a manner as to make Ag very small 
(that is, confine the electron to a limited region of space, as in the Bohr 
theory) then Ap must become very large, and since the average value of 
T for one coordinate is approximately (Ap) 2 /(2/*), the kinetic energy 
will also increase enormously. On the other hand, the potential energy 
will vary as 1/Ag, that is, V will be lower (more negative), the smaller 
Ag. The best compromise is obtained by choosing such a form for < 
that the charge will be spread over a fairly extensive range ot values 
of r, so that, as a consequence, the magnitude of either T or V will not 
become excessively large. 
These considerations will appear as specially important when we 
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come to deal with atomic systems involving more than one electron 
and with molecular systems. In these and similar cases, the distribution 
functions for the electrons must satisfy the condition that the total 
energy of the system, as derived from an equation such as (53), shall 
be a minimum. 

7*9 Angular Momentum of Electron. In the last section of Chap- 
ter VI it was deduced that the average value of M 2 may be computed 
from the relation 



where Q is an operator defined in terms of 6 and 17. 

In order to determine whether the electron in state n, Z, m possesses a 
definite value of M, it is necessary to find whether the relation 



frnlm = 
ttTT / 

has solutions of the form 



m = a<t> n i m , 

where a is a constant. In that case the solution is M 2 - a. 
For the electron in the hydrogen-like atom 



Hence, 

(r) Y lm (d, 



since 1 is an operator which does not involve r. 
But from the solution of equation (5.14) it follows that 



Consequently, 

m = 1(1 



Hence we conclude that the solution is 

h*l(l + 



_ N 
(55) 
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That is, in the state of quantum number Z, the total angular momentum 
is Vl(l + 1) in units of h/2ir. In a similar manner it may be deduced, 
as in the case of the rigid rotator, that the angular momentum with 
respect to the 2-axis is 
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The most complete treatment of the hydrogen atom system is given in Pauling 
and Wilson's " Quantum Mechanics," Chapter V, and by H. Bethe, in " Handbuch 
der Physik," XXIV, Part I, pp. 274-289. The latter discusses the Laguerre functions 

very fully. 

In connection with section 9, the readers should consult the discussion by Slater 
and Frank, "Theoretical Physics," Chapter XXIII, and also the remarks by H. E. 
White, " Introduction to Atomic Spectra," Chapter IV. 



CHAPTER VIII 
VAN DER WAALS FORCES 

8.1 Van der Waals 1 Equation. An ideal or " perfect " gas obeys 
the equation of state 

PV = nRT, (i) 

where Pis the pressure; V, the volume; n, the number of moles; T, the 
absolute temperature; and 72, the gas " constant." For P in atmospheres 
and V in cubic centimeters, R = 82.05 (cm. 3 atmos. deg."" 1 mole"" 1 ). 
This equation is valid for low pressures and comparatively high tem- 
peratures, where forces of attraction and repulsion between molecules 
in the gas have a negligible effect. But at higher pressures or low 
temperatures, especially near those of condensation to the liquid state 
and at pressures of the order of an atmosphere or higher, it has been 
found that equation (i) is not in agreement with the observed data. 
A very large number of empirical or semi-theoretical equations of state 
have been suggested, but that postulated by van der Waals is the best 
known and has been accepted most generally. This equation has the 

form 

/ ~ \ 

(ii) 

where V m is the volume per mole, and a and b are constants whose 
values are determined from actual observations on the relation between 
Pj V m , and T over a range of pressures and temperatures. 
In this equation the constant b is given by the relation 

6 = 



where N = Avogadro's number = 6.064 X 10 23 molecules per mole; 1 
v = volume of a molecule = (^)^ } where r = distance between 
centers of molecule = molecular diameter. 

The constant a is derived from the following considerations. The net 
force acting between two molecules is the resultant of a force of attrac- 

1 Values of physical constants used by the writer are those given by R. T. Birge, 
Phys. Rev. Suppl.,1, 1 (1929); also " Smithsonian Physical Tables," edited by F. E. 
Foule, 1933, pp. 73-107. Also see Appendix II. 
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tion which varies with the distance according to a law of the form 

^- -;=' 
and a force of repulsion, which may be written in the form 

* - 2* 

r n 

where A, B, m, and n are constants for any given pair of molecules. 
Hence the total force acting between the two molecules is 

/--+ 

The mutual potential energy U(r) is given by 



If the molecules (or atoms) condense to form a solid lattice, they will 
assume positions of equilibrium such that / = 0, and consequently 
dU/dr = 0, that is, the potential energy will be a minimum for the posi- 
tion of equilibrium. Let r be the value of r for this condition. Then 

= - M 

r% rS' 

and the mutual potential energy becomes 

A (n- m) 



Since C7 is negative, it follows that n must be greater than m. An 
illustration of such potential energy plots is given at the end of this 
chapter (Fig. 51). 

In terms of U(r), as determined by means of equation (iv), the con- 
stant a is given by the relation 2 



ffKL V I T T / \ Oi / \ 

a _. i (J(r) r dr. (vn) 

1.013 X 10 6 Jro 

where a is in atm. cm. 6 mole"" 2 , and 1 atmosphere = 1.0132 X 10 6 
dynes/cm. 2 

2 F. London, Z. Physik, 63, 245 (1930). See also references to collateral reading 
at the end of the chapter. 
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It should be added that according to van der Waals 



8P C 

(viii) 
27 

64 

where P c and T c are the critical pressure and temperature, respectively. 

The problem regarding the origin of these attractive forces between 
molecules engaged the attention of many investigators and various 
explanations were suggested. Each of these hypotheses was, however, 
found to be unsatisfactory, and it was only on the basis of wave me- 
chanics that a satisfactory theory was ultimately developed by F. Lon- 
don 3 in 1930. According to this point of view, the van der Waals 
forces of attraction are shown to be due to a dynamical polarization of 
each molecule in the electric field due to its neighbor. The motion of 
the electrons in one molecule or atom modifies that of the electrons in 
the other molecule or atom in such a manner that the ekctrons tend on 
the average to move in phase. The forces that arise are thus similar 
in nature to those between dipoles. (A dipole is a system consisting 
of a positive and negative charge located at a definite distance from 
each other. If e denotes the magnitude of the charge at each " pole/' 
and I, the distance between the point-charges, the dipole moment is 
given by n e = el.) Only in the case of attraction between atoms or 
molecules the dipole moments fluctuate between maximum and mini- 
mum values and the relative orientations vary continuously. 

The theory of London and similar considerations by J. C. Slater and 
J. G. Kirkwood 4 lead to the conclusion that the van der Waals attrac- 
tive force between two molecules varies inversely as the seventh power 
of the distance. On this basis it has been found possible to calculate 
from the electron configurations of monatomic molecules (e.g., the rare 
gases) values of the constant a which are in satisfactory agreement 
with the values deduced from the equation of state or from critical 
data. 

In order to understand better the point of view and the arguments 
developed by London, we shall consider first a phenomenon in classical 
mechanics which presents a certain similarity, from the mathematical 

3 F. London, kc. dt., also Z. physik. Chem., Bll, 222 (1931). It should be men- 
tioned that S. C. Wang, Physik. Z., 28, 663 (1927), first suggested the theory of 
dipole interaction, but his method of calculation led to results which have not been 
confirmed by subsequent investigators. 

4 Slater and Kirkwood, Phys. Rev., 37, 682 (1931). 
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point of view, to the interaction of two atoms as postulated by the 
wave mechanics theory. 

8.2 Two Interacting Pendulums. Whenever we have two vibrating 
systems which are coupled, however weakly, it is possible for an inter- 
change of energy to occur between them. The simplest illustration of 
this phenomenon is the behavior of two identical pendulums mounted, 
adjacent to each other, on a support which permits the motion of one 
pendulum to be transmitted to the other. The following description 
of the behavior of the pendulums is taken from the discussion by 
C. G. Darwin. 6 

We set the pendulum A in motion, while B is at rest. If the support were quite 
rigid A would go on swinging, and B would stay at rest. But there is a small effect 
on B through the give of the support, and consequently B begins to move. What 
happens is rather striking, for B starts swinging more and more and at the same 
time A'a motion diminishes until B is swinging to the full amount while A has come 
to rest. Afterwards the motion is transferred back to A again, and it continues 
passing from one to the other until the motion has died away altogether. 

The reason for this behavior is easily understood. The system of the two pen- 
dulums, like every other vibrating system, has normal modes of vibration, but 
these do not consist one in the motion of A and the other in the motion of B. The 
modes are shown in Fig. 43. In one of them A and B are swinging equally both 
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FIG. 43, Illustrating the two modes 
of vibration of two interacting pen- 
dulums. 
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FIG. 44. Motion of each pendulum as 
a superposition of the two modes. 



to the left at the same time, and in the other they are also swinging equally, but 
now when one is to the left the other is to the right. To distinguish the two modes 
I shall call them by names of which the application is not very obvious for the 
pendulums, but which are convenient because they are used in the quantum theory. 
The first is the symmetric mode, the second the antisymmetric. The two modes 
have frequencies of vibration which differ from one another a little, because the 
effect of the yielding support is different in the two cases. The motion we described 
before in which B and A alternately come to rest is the superposition of these two 
vibrations. At one time the phases of the two modes are such that A is to the left 
at the same time for both. Then B would be to the left for one and to the right 
for the other and so by the principle of superposition it is at rest. Later, on account 
of the slight difference of the frequencies of the modes, it will occur that the two 
modes are in opposite phase for A, and therefore in the same phase for B, and conse- 
quently the motion will now have passed entirely to B. The exchange of the motions 

5 Darwin, " The New Conceptions of Matter," Chapter VIII. 
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is illustrated in Fig. 44. It is perhaps well to notice that in likening the exchange 
of motion of the pendulums to the beats of the two piano wires, the analogy is not 
between each wire and each pendulum, but between each wire and each mode of the 
two pendulums. It is the two modes that beat together. 

The relation between the frequencies of the two modes and that of 
each pendulum when not affected by the presence of the other and the 
character of the motion of each pendulum when interaction occurs are 
described in the following remarks by N. F. Mott. 6 

The frequencies of the system in these two normal modes are not equal to one 
another, and neither is equal to the frequency with which either pendulum would 
oscillate, were the other one absent. Let us call this latter frequency v and the 
frequency with which the pendulums vibrate in phase v\, and with opposite phase 
?2. One can see that ?2 is greater than v and *i less than v. For when they are 
swinging in opposite phase, the pendulums are, so to speak, pulling one another back 
all the time, and so increasing the restoring couple. The frequency must therefore 
be greater than it would be in the absence of one of the pendulums. When the 
pendulums are swinging in the same phase, the opposite is the case; the pendulums 
help each other to swing outwards, the restoring couple is decreased, and so the 
frequency is less than it would otherwise be. One can also see that the amount by 
which vi and v% differ from v depends on the strength of the coupling between the 
pendulums. For instance, if the string on which the pendulums are hung be fairly 
tight, and the two pendulums are hung very near to opposite ends of the string, then 
vi and *2 will not differ from v by as much as they would if the pendulums were hung 
nearer to the middle of the string. 

It is on the existence of these two different frequencies v\ and vz that the slow 
exchange of energy from one pendulum to the other and back depends. If any 
system is capable of vibrating in a certain number of normal modes, then the most 
general vibration possible is obtained by superimposing these normal modes, one 
upon the other. In our case, if 0i, 2 are the angles through which the two pendu- 
lums have swung at any moment, then the equations, giving Q\ and 6% for the first 
normal mode (when the two vibrate in phase) are 

01 = A cos27r(i>iJ + ), 02 = A cos 2*(it + ) 

In the second normal mode (in which the pendulums swing in opposite phase) 
0i B cos 2v(vtt 4-0), 62 = - B cos 2r(i*f + 0). 

A and B are arbitrary amplitudes, a, are arbitrary phases. The most general 
possible vibration, obtained by adding together these two, is 

0i = A cos 2ir(nt + a) + B cos 2*- fat + 0) 



(ix) 
A cos 2w(nt + a) - B cos 2rfa( + 0)J 

The angular velocities of both pendulums can be found by differentiating these 
equations with respect to t. Now suppose we start the system swinging at time 
t = 0, with one pendulum at rest in its position of equilibrium, and the other pulled 
aside through an angle C. Then at time t = 0, say, we must have 

0i - 0i - 0, $2 - C, $2 - 0. 
6 Mott, " An Outline of Wave Mechanics," Chapter VI. 
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The constants A, B, a, are therefore determined. We find that 

- 
a _ _ f - 2 . 

Writing (ix) in a slightly different form, then, we have for 0i, at any time t, 

0! - -( 
and for 62 



(x) 



(xi) 



Now, v\ and v% are only slightly different from v and from each other. Therefore 
the second factor in (x) and (xi) has a very much longer period than the first. Each 
of the expressions (x) and (xi) then represents a vibration with frequency very nearly 
equal to v, and with varying amplitude 



cos 2 

While the amplitude of the one increases, the amplitude of the other decreases. In 
fact, a measurement of the time that it takes for the energy to go over from one 
pendulum to the other and to come back again amounts to a measurement of v\ v*. 
If the coupling between the two pendulums is strong, this time will be shorter than 
if it were weak. 




20 40 60 80 100 120 140 160 180 200 220 
FIG. 45. Plot of the function y = sin 2irvt sin 27r(^/9). 

It is interesting to realize the interpretation of the expressions for 
and 82. Figure 45 gives an illustration of such a vibration in which 



. . rt 

0i = y = sin 2irvt sin - = t/ sin 2irvt, 



m 



where v/m = (vi v 2 )/2, v = (PI + v 2 )/2, and m = 9. That is, 
the period (r = \/v) of the vibration of frequency (PI v%) is nine 
times that of the frequency (v\ + j> 2 ). The expression for 2 in equa- 
tion (iii) gives the same curve, but for t = 0, it has a maximum value 
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corresponding to the abscissa 7r/2, instead of the value which is 
obtained in the case of 0i for i = 0. 

8.3 Electron in Two Adjacent " Boxes." As mentioned in Chapter 
III, an electron in a potential " box " resembles in its behavior that of 
the electron in a hydrogen atom. The electron in the box possesses 
a series of discrete energy states because it is only for these states 
that stationary de Broglie waves are obtained by the reflection of the 
wave motion at the walls. For an electron between barriers a distance 
2a apart, the energy states are given by the relation 



and according to equation (3.30), the eigenf unction for the lowest 
state EI is 

fa = 2A cos ax, 

which determines the probability amplitude for the electron inside the 
barriers, while outside these barriers the eigenfunction is 

*n; in = 2A cos ota ^ (a; ~ a) . 
In these expressions, 

2 8* 2 ^ 2 faW - E) 

01 = -tf-> & ~ jp ' 

where E = EI (in this case) and U (> E) is the value of the potential 
energy outside the barriers. 

Now let us consider the behavior of an electron in a potential energy 
field such as that shown in Fig. 46 (a), which corresponds to two potential 
boxes (of atomic dimensions) adjacent to each other. 7 Evidently, if 
the compartments are very far apart (relative to the dimensions of the 
compartments), the electron will remain for a very long period in the 
particular region in which it is placed. But as the distance between 
the boxes is decreased, there will be an increasing probability of pene- 
tration from one region into the other, and for relatively small distances 
(of the order of the de Broglie wave length) there is an equal probability 
for the occurrence of the electron in either box, if observations are 
carried out over a sufficiently long period. 

This is due to the fact that the <-" pattern " for each energy state 
extends over all the space outside the barriers, and there will be a por- 
tion of the cosine curve in each compartment. In Fig. 46 (b) the curves 

7 See non-mathematical discussion of this case by R. W. Gurncy, " Elementary 
Quantum Mechanics," pp. 44-48. 
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M FK and NGH show that each <-plot extends into the other box when 
the two boxes are a distance 26 apart, where 6 is comparable with a; 
the ordinates of the exponential portions at the edge of the other box 
(BK or AH ) are no longer vanishingly small. There is a possibility 
of interaction, analogous to that observed in the case of the two pendu- 
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FIG. 46. Illustrating the two eigenfunctions for electron in two 
adjacent "boxes." 

lums, and the two </>-curves which were quite separate for the boxes at an 
infinite distance will merge into one. 

When we consider the problem of joining these two curves by a curve 
which shall be continuous and of finite amplitude at all points between 
the two boxes, it is seen that there are two solutions possible. These 
are shown in Fig. 46 (6) and (c), corresponding respectively to the 
symmetrical and antisymmetrical modes of vibration or eigenfunctions. 
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In the former case, the ^-curves for each compartment are joined by a 
portion which corresponds to the hyperbolic cosine 

coBh/te-J (/" + -*); 
in the second, the intermediate portion corresponds to the hyperbolic sine 

sinh 0* = | (** --**). 

(These functions have been discussed in Chapter II, and illustrated in 
Fig. 6.) 

Now let us consider first the symmetric case. The value of < (for the 
lowest energy state) at A is given, in the case of an electron in a single 
compartment, by the ordinate AF = 2A cos aa, and the exponential 
part FK by the relation 



where the origin of abscissa is at C. Hence at x = 26 + a, 

<ii = 2A cos aa e"~ 2/36 

which corresponds to the ordinate BK. Since the </>-patterns are 
symmetrical about 0, BK = AH. The value of in the region AOB 
will be given by the sum of the values which are due to the exponential 
portions FK and GH. S This is indicated by the curve F'RG', so that 
F'A = FA + AH, and hence 

F'A = G'B = 21 cos aa(rW + 1). 

But if the function </> and its derivative d<t>/dx are to be continuous 
over the whole range of values of x, it is evidently necessary that the 
ordinate of the cosine curve inside the box shall also be increased. 
That is, the ordinate AF f must correspond to a value <fa = 2A cos a\a 
where ai is different from a. In other words, the necessity for con- 
tinuity of the (^pattern leads to a change in the value of a 2 = Sir 2 pE/h 2 , 
and consequently to a change in the value of E for the energy state of the 
electron. We thus derive a relation between i and a, of the form 

2A cos ia = 2 A cos aa(l + ~~ 2/36 )> 



cos a\o, . . _O/M, /i \ 

that is, - - = 1 + c 2f * b . (1 ) 

cosaa 

Thus cos aid is slightly greater than cos aa, and therefore a\ must be 
less than a. (cos 0=1, cos ir/2 = 0.) That is, the value E 8 , for the 
symmetric case, is kss than E. Equation (1) may be expressed in a 
more convenient form thus: 

8 This is an illustration of the application of the Principle of Superposition, dis- 
cussed in Chapter II. 
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Let a\ = a Aa, where Aa denotes a very small change in a. Then, 
cos (a Aa)a cos aa cos (aAa) + sin aa sin (aAa) 



cos aa cos aa 



COS aa 



(2) 



since cos (aAa) = 1, approximately, and sin (aAa) = aAa, approxi- 
mately, and according to equation (5.21), tan aa = ft /a. From equa- 
tions (1) and (2) it follows that 



Since 2aAa = 87r 2 /i &E/h 2 where AE = decrease in energy, it is evident 
that the last equation may be replaced by the relation 

4irb 
- X ' X , 

E ~" Tra 



where X 



V2p(U -E) ft 

represents a distance. 

Thus, suppose b = a and X = 4a = 46. Then, 

^ = il = 0.055. 

E 7T 

If 2a = 2 X 10"" 8 cm., E = 9.15 electron volts, as shown in Chapter 
III, and AB = 0.50 electron volt. Since X = 12.21 X lO^/VT cm., 
where 7 is the kinetic energy in electron volts, 9 the value X(= 4a = 
4 X 10~ 8 cm.) corresponds to a value for U - E of (12.21/4) 2 = 
9.38 electron volts. 

9 Since X = h/V2E, and E = Ve, where V is the potential difference required in 
order that the electron shall acquire the given kinetic energy, it follows that 

6.55 X 10- 27 12.21 X 10~ 8 cm. 



2 X 9 X 10- 28 X 4.77 X 10 "" 10 



where e = 4.77 X 10~ 10 electrostatic units; 

M = 9XlO- 28 g.; 
h =6.55 X 10- 27 erg. sec.; 

conversion factor from ordinary volts to electrostatic units 
300 of potential. 
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The antisymmetric case is treated quite similarly. The full curve 
Fig. 46 (c) shows the hyperbolic sine portion F'OG f . In this case, 
F'A = FA FF f = FA BK, so that we obtain the relation 



cos 



cosaa 



i-r 



(5) 



Hence 2 must fee greater than a. That is, the value of EA> the energy 
of the electron for the antisymmetric case, is greater than E, and as in 
the symmetric case it is seen that the increase in energy A2? is given, 
as before, by equation (4). 

Thus, it is seen that the original ground level E\ for the electron in 
the potential box, has become split into two by the presence of the other 
box adjacent to it. These two levels are equally spaced about the original 
level, and, as is evident from equation (4), the difference between each 
level and the unaltered level is given by 



hE l 



(6) 



It will be observed that the exponential factor represents the prob- 
ability of penetration of the electron through the barrier between the two 
potential boxes. This is evident from the remarks in Chapter III, 

especially those relating to equation 
(5.44). Thus for 6, very large, or 
U E, infinitely great, the value 
of this probability becomes infin- 
itesimally small, and AE tends to 
vanish. 

The bearing of these conclusions 
on the problem of the energy levels 
for the ionized hydrogen molecule 
(H 2 + ) is self-evident. The two po- 
tential barriers are represented by 
FIG. 47. Illustrating the two eigenfunc- the potential fields around the two 
tions for electron in ionized hydrogen ^^ ^ thfi gingle electron must 

molecule (H, ). fluctuate between these two coulomb 

fields. The potential function ( -e 2 / r ) is indicated in Fig. 47 10 , by the 
two sets of hyperbolas which are joined in the center. The energy of the 
electron in the lowest state is given for the individual atom, as shown 

10 N. F. Mott, loc. dt., Fig. 10, p. 96. 
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in Chapter VII, by the relation, 



h 2 

and the eigenfunction is given by 

1 /1V ~ 



But when the two nuclei are brought together within a distance com- 
parable with a , the probability of transition of the electron from one 
nucleus to the other becomes appreciable, and consequently each energy 
level becomes split up into two levels, very close together and only 
slightly different, of which E$ is less than E, that is more negative, and 
E A lies above E, i.e., is more positive. Also corresponding to these 
two energy levels into which each original level is split, there are two 
separate modes of vibration, or eigenfunctions, which are indicated 
in Fig. 47 by <t>i (corresponding to E$) and < 2 (corresponding to EA). 

8.4 Coupled Linear Harmonic Oscillators. The behavior of an 
electron in two neighboring potential boxes is represented, as discussed 
in the previous sections, by two modes of vibration. Two identical 
linear harmonic oscillators when coupled together exhibit a similar 
behavior, and it was by investigating the nature of the interaction of 
two such oscillators that F. London deduced his theory of the origin of 
van der Waals forces. Before presenting this theory, however, it is 
necessary to review briefly the concepts, electric moment, and polariza- 
bility of molecules. 11 

An atom of argon or molecule of methane has a symmetrical dis- 
tribution of electrons about the nucleus or nuclei. That is, the center 
of gravity of the electrons coincides with that of the positive charges. 
Such molecules are said to be non-polar. If, however, these molecules 
are placed in an electric field as, for instance, that produced between 
the plates of a charged condenser, the positive and negative charges in 
the molecule are attracted toward the oppositely charged plates, so that 
the two centers of gravity no longer coincide. If z denote the separation 
of the two sets of charges, the molecule now behaves like a dipole of 
electric moment p e = ez. The magnitude of 2, and consequently that of 
the induced moment, is proportional to the field F (in volts per centi- 
meter), in accordance with the relation 

Me ez F, (7) 

11 See references at the end of the chapter. 
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and the constant a is designated the polarizaUlity. The polarization 
energy is therefore given by 

r*** C F otF 2 

U p = - I Fd e - - I aFdF --- (8) 

/o /o * 

where the negative sign indicates that the energy is that of attraction. 

In order to illustrate London's theory in the simplest manner, let us 

consider the interaction of two linear 

j^y J" "t u JT harmonic oscillators, arranged as shown 

} x ' _ >j ' in Fig. 48, where R is the equilibrium 
FIG. 48. Interaction of two dipoles distance between the positive ends of 

arranged m the same direction. the dipoles, and u\ and u 2 , represent- 

ing the amplitudes of oscillation, are 

each small compared to R. 12 We shall assume that the masses M and 
charge e are the same for both oscillators, as well as the restoring force 
constant fc. 

The mutual potential energy of the two oscillators is given, according 
to Coulomb's law, by the expression 



___ _ 

R + u% ui R + uz R MI R 



R 




Since ui/R and u^/R are each considerably smaller than 1, we can ex- 
pand each of the terms thus : 



and 




12 This illustration is taken from the discussion by M. Born and M. Goppert- 
Mayer in " Handbuch der Physik," Vol. XXIV, Part 2, p. 750. 



COUPLED LINEAR HARMONIC OSCILLATORS 221 

Neglecting terms in these expansions involving RT* and more nega- 
tive powers of R, we obtain the relation 

(9) 

By breaking the series at this point we take into account only dipole- 
dipole interaction. The following terms of the series would correspond 
to dipole-quadrupole interaction (term involving RT 4 ) and quadrupole- 
quadrupole interaction (term involving RT 6 ). 

We can now write down the total energy of the system in the Hamil- 
tonian form (see Chapter IV) 



E ' A* \f > <// 2 M 

For large values of R the last term vanishes, and we have two identical 
linear oscillators, for which the frequency is given by the relation 

-* <"> 

When, however, R is decreased, the last term in equation (10) has 
to be taken into account. The field due to each oscillator acts on the 
other, and, in accordance with equation (8), 

\ ku? = |F 2 , (12) 

where F is the field and u the resultant displacement. 
The resulting dipole moment is given by 

p. = ue = aF. (13) 

It follows from (12) and (13) that 

- (14) 

k 

In the absence of the last term in equation (10) the total energy E is 
the sum of the energies EI and E 2 of two simple oscillators. However, 
the presence of the product term involving uiU 2 , which is designated 
the perturbation potential energy term, shows that the simple additive 
solution cannot be valid. Under these circumstances it is necessary in 
general to resort to the perturbation theory, and in wave mechanics 
there is available a similar mathematical technique for dealing with the 
corresponding S. equation, which is discussed in the following chapter. 
In equation (10) it is possible, however, to obtain an expression for E 
by a much less tedious procedure. This is the method of transformation 
to so-called normal coordinates (" Hauptachsen "). 
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In equation (10) let us introduce the new coordinates qi and 32 such 
that 

1 



and 



Hence, 



and 



= -= (91 + 92), 



1 

V2 



(15) 



(16) 



It will be recognized that this transformation corresponds to a rotation 
of the rectangular axes u\ and u 2 through an angle of 45. This ac- 
counts for the similarity between the two sets of equations (15) and 
(16). 

In terms of these new variables 



- M 2 {<?? + 40, 

so that PI and p| now re f er to momenta associated with the new co- 
ordinates. 
Furthermore, 

2uiU 2 = ql - qi 

Therefore equation (10) becomes 






H7) 



In this form, the energy is again separable into two terms, corre- 
sponding to two new modes of vibration of which the frequencies v\ and 
V2 are given by the relations 



(18) 



It is also evident from the equations in (15) that vi and v 2 corre- 
spond to the symmetric and antisymmetric modes respectively. 



1 



2e* 
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For values of x < 1 
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and 



2 8 16 

2 /8 



(19) 



Since 2e 2 /(kR*) is small compared to unity, we can expand the ex- 
pressions under the radical signs in (18). The resulting relations are 
given by 

2e 2 . 



and 



where VQ is given by equation (11). 

For a single oscillator, in accordance with the result obtained in 
Chapter V, 

E n = hp Q (n + I). (5.16) 

Hence, in the present case 






2e 2 



For 



0, 



*oo 



in consequence of (14). 
Thus the coupling energy &E is given by the relation 

A hvoa 2 



(20) 
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Before treating the same problem from the point of view of the S. 
equation, it is of interest to consider the relatively simpler case of a linear 
harmonic oscillator acted on by a uniform electric field F. 1 * 

In absence of a field, as shown in Chapter V, the S. equation has 
the form 



(21) 

where a = &Tr 2 n/h 2 . 
The eigenfunction corresponding to the state n = is 



where x 



Since 

1 /* l /* 

1 _ -L. I c^dx = / e - b *Vb dg, 

\/f[>J oo \/7i-t/ oo 

therefore, < > as a function of the actual displacement g, is given by 



C! A 



Then, 6 = = 2cj , , 



and *,() = - r-o 2 . (22) 

If the oscillator is subject to a uniform electric field F the potential 
energy term in (21) becomes 

V 



2k 

18 The following discussion is based upon the paper by J. E. Lennard-Jones, Proc. 
Phys. Soc. (London), 43, 461 (1931). 
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Thus, if we let z q eF/k, the wave equation (21) becomes 



which is of the same form as (21). The energy values, however, are 
given by the relation 

(24) 




That is, the value of V Q is unaltered, and fa(z) is identical in form with 
0o (#)> but the whole wave pattern is displaced a distance eF/k in the 
direction of the field. This is similar to the phenomenon which occurs 
when a molecule or atom is polarized, and the polarization energy is given, 
according to the last equation, by 



where a = e 2 /k. 

Let us now consider two identical linear oscillators, each vibrating 
along the g-axis, with centers a distance R apart. There will be an inter- 
action due to the field produced by each oscillator in the region of the 
other, and in accordance with equation (10), the corresponding S. 
equation has the form 



(26) 



where is a function of qi and # 2 as well as of R. 

This is a type of equation which occurs very often in quantum 
mechanics, in fact it is typical of all cases in which there is an interaction 
between two systems. We meet with it, in a modified form, in the investi- 
gation of the helium atom and of the hydrogen molecule. 

Now if the perturbation energy term were not present, the solution of 
the equation 



would obviously be obtained by putting 
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For, if we substitute the latter in (27) we obtain the equation 




Since <(tfi) is a function of qi only, and 0(52) of g 2 only, it follows that 
each of the expressions in large brackets is equal to zero. That is, we 
obtain the two ordinary differential equations which are each similar 
to the S. equation for the linear harmonic oscillator. Consequently, the 
solutions are similar, with the only difference that for qi 



and for 



where E = EI + E% = total energy and n\ and n% are not necessarily 
identical. 

In order to solve equation (26) we use the same transformation to 
normal coordinates as indicated by equations (15) and (16). That is, 
we set 



(28) 



and, as in the derivation of equation (17), we obtain the S. equation 
in the form 

2 \2 i 
a <p . o <p , 



where 



and 



2e_ 2 

/e 3 



(29) 



(30) 



Equation (29) is evidently separable into two ordinary differential 
equations, of which the solutions are <t> ni (ti) and Gnfez) with the 
eigenvalues E Hl and E n ^ and the two new frequencies are given by 
equations (18). 
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It follows from equations (30) that the frequency v\, which is less 
than VQ, corresponds to that of the symmetric mode, and v% (which is 
greater than VQ) corresponds to the frequency of vibration for the 
antisymmetric mode. Thus the behavior of two coupled oscillators 
resembles that of the electron in two potential boxes or in the ionized 
hydrogen molecule. Instead of an energy EI = hv Q for the lowest state 
of the oscillators, we obtain the result E = fyhvi + ()A?2. 

Obviously, the transformation to principal coordinates is no longer 
valid if k is less than 2e 2 /22 3 . That is, the conclusion involved in 
(20) is tenable only if JR 3 > (2e 2 /k), which determines a lower limit 
for the distance R between the zero positions of the oscillators. Since, 
as shown in equation (14), e 2 /k = a, the polarizability, we can also state 
this limit in the form R 3 > 2a. It should be observed that a may be 
derived from measurements of dielectric constant or refractive index, 
and is of the order of magnitude 10~ 24 cm. 3 Thus this theory of inter- 
action of linear oscillators is valid for values of R > 10"~ 8 cm. approxi- 
mately, which corresponds to intermolecular distances. 

For the state of lowest energy HI = n 2 = 0, we thus obtain the 
value for the coupling energy AS given by equation (20). 

The negative sign shows that the energy arises from attractive forces, 
and, since the energy varies as RT Q , the force of attraction is given by 

"-- S-- 8 ?*- 

That is, the force of attraction between two pulsating dipoles varies in- 
versely as the seventh power of the distance. It, therefore, falls off rapidly 
with increase in distance. Thus, if R is doubled, the force decreases to 
(i) 7 = 1/128 of its original value. 

The behavior of the eigenf unction 0(21, z 2 ) throws additional light upon 
the nature of this attractive force between linear oscillators. For the 
lowest state (nj = n 2 = 0), it follows from equations (22) and (29) that 



(^v), (32) 

\ T / 



(33) 



where {<o(tf)} 2 is the product <t>(qi) #(#2) for R = [solution of 
equation (27)] and V is the perturbing potential energy function, 
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defined by the relation 



From equation (33) we deduce the probability distribution function 
for the coordinates of the two oscillators in the form 



(34) 



The exponent of c in this equation is given by the expression 
which shows that when z\ and z 2 are of the same sign 
(the pulsations are in phase), < 2 (zi, z%) > 
(<t>(qi) <(?2)} 2 > an d when z\ and z 2 are 
opposite in phase, the reverse is true. As 
Lennard^Jones points out: 




This result can be represented diagrammatically 
in a two-dimensional space, as shown in Fig. 49. 
The distribution function, being proportional to 
r 2c(v i'l+"2 z l } [see equation (32)], is like a ridge 
with its maximum at the origin, and with ellip- 
tical contours. 14 The major axes of the ellipses 
FIG. 49. Phase relations for two m * 2 " i - +2, and the minor axes 
coupled linear harmonic oscil- * = "^ The corresponding distribution with- 
l a ^ ors out interaction is a round-topped mountain with 

circular contours. The probability of finding q\ 

and 72 with the same sign has increased, while that of finding them with opposite 
sign has decreased. In other words, the interacting dipoles tend on the average 
to move in phase. 

8.5 Van der Waals Interaction Energy for Two Hydrogen Atoms. 
Let us now consider the interaction of two hydrogen atoms separated by 
a distance R which is large compared with the average distance of the 
electron in each atom from its associated nucleus. (The following 
argument is valid only for this case.) Disregarding the potential energy 
terms due to the force between electron and nucleus in each atom, the 

14 The equation of an ellipse with respect to rectangular coordinates through 
the center is x 2 /a? + y 2 /b 2 = 1, where a and b are semi-major and -minor axes, 
respectively. Hence the expression 2c(? 1 2f + vtf\) is constant for all values of 
zi and 2 which occur on the ellipse whose semi-axes are l/V^cn and l/V^ci^. 
Thus, if we have a series of confocal ellipses, as indicated in Fig. 49, we can describe 
any simultaneous combination of values of z\ and 22 by stating the serial number of 
the corresponding ellipse. The use of such coordinates is often very convenient 
in mathematical problems and has proved useful in dealing with problems of poten- 
tial. 
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potential energy arising from the Coulomb forces between charged 
particles in one atom (nucleus A and electron l) and similar particles 
in the other (nucleus B and electron 2) is given (see 
Fig. 50) by 



- - (35) 



Let Xiytfi and z 2 j/ 2 z 2 designate the coordinates 
of each electron (1 and 2 respectively) with respect 
to its nucleus (A and B respectively), and let the 
line joining the nuclei lie along the z-axis. Then 

r? 2 = (*i - * 2 ) 2 + (0i - </ 2 ) 2 +(*! + - * 2 ) 2 , 
r 2 A2 = x\ + y 2 + (R - * 2 ) 2 , 



Hence, 

2 J8 J_ J* 

Fl2 = T-i -r ^2 




FIG. 50. Illustrat- 
ing van der Waals 
interaction of two 
hydrogen atoms. 



+ r? + 2/2*!. 



Also 



2 ri 

/? 2l 2 I ' 

and similar expressions may be written for l/r^ 2 and I/TBI* Replacing 
each of the expressions in the brackets as in (18), by means of the 
expansion 



we obtain the relation 



n 

V = 



2122 



tems 



involving (1/R)* and higher powers of 
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The expression involving 1/R 3 corresponds to the dipole-dipole inter- 
action and is given by 

e 2 



Equation (36) may he derived directly, 16 without going through 
the preceding argument, if we consider the motion of the electron in 
each atom as equivalent to that of a dipole, for which both the electric 
moment and direction of orientation are varying continuously. From 
this point of view we therefore regard each atom as a dipole for which 
the respective components along the three coordinate axes are exi, ey\, 
ezi, and e# 2 > 03/2 > ^2* The result derived in equation (36) is thus an 
extension of that derived in equation (9). 

Lennard-Jones writes: 

The probability distribution function can now be represented only in six-dimen- 
sional space. It appears that z\ and 22 tend to have the same sign (as before), while 
x\ tends to be opposite to x^ and y\ opposite to y%. These are just the configurations 
for which the oscillators attract. Hence, we may say that two systems tend to 
interact in such a way that the attraction is a maximum. It is in the generalized sense 
that we use the expression, " tend to move in phase/' for actually in this example the 
X'B and y's tend to be out of phase in the usual sense. 

Such a procedure fails, however, to take into account terms involving 
powers of (1/R) higher than (1/R) 3 . Furthermore, as will be shown 
in the following chapter, it is possible, by application of the second- 
order perturbation theory to the relation for V given in equation (35), 
to derive a more accurate value of the interaction energy of two 
atoms. 

It follows that the attraction due to each term x\x^ y\y^ and z\z^ can 
be treated separately, and as shown by H. R. Hass6, 16 the interaction 
energy due to each of the first two terms is J of that due to the z\ and z% 
oscillators, and therefore equal to a 2 /tj> /(8# 6 ). Hence, the total 
interaction energy for a pair of three-dimensional oscillators iS 



3 / N 

"4 IB 5 "' ( } 

Let us now attempt to apply this result to the calculation of the 
interaction energy for two hydrogen atoms. 

16 See, for instance, J. H. Jeans' " The Mathematical Theory of Electricity and 
Magnetism/' Cambridge Univ. Press (1908), p. 368, equation (355). 

16 Hasse\ Proc. Cambridge Phil. Soc., 27. 66 (1931). See also F. London, 
Z. physik. Chemie, Bll, 222 (1931). 
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By investigating the behavior of a Bohr orbit in an electric field it has 
been shown that the polarizability of the atom is given by a = (9/2)a, 
where ao is the radius of the orbit. 17 

Hence the interaction energy of two hydrogen atoms is given by 
the relation 

243 hv Q 

AS== -' (376) 



What value of V Q can we assign to the electron in a hydrogen atom? 
Obviously, this " frequency " is a fiction, since, on the basis of the 
Principle of Indeterminacy, no experiment can be devised to determine a 
frequency of revolution of the electron in its orbit. But when in doubt, 
it is customary to appeal to the Principle of Correspondence (see Chap- 
ter I). According to this principle the behavior of the electrons in an 
atomic system must approach more and more that predicted by classical 
physics the higher the quantum number of the orbit. This is equiva- 
lent to the statement that when dealing with larger-scale phenomena the 
results deduced by quantum mechanical methods must approximate 
those deduced by classical mechanics. 

In the case of the hydrogen atom, the electronic orbits increase in 
radius as the ionizing stage is approached. If Vi denote the ionization 
potential, then the limit of the line spectrum, that is, the maximum fre- 
quency of light which can be emitted by the return of an electron to 
the lowest level, is given by 

V* 
vo = -ir (38) 

fl 

According to electromagnetic theory, radiation of this frequency 
would be emitted by an oscillator having the identical frequency V Q . 
Therefore, we might use in equation (37a) the value of v given by the 
last equation. 

Again, the resonance potential (V r ) of hydrogen corresponds to the 
first excited state, and the frequency of radiation emitted because of a 
transition from this level to the lowest or normal state is 

*-TT (39) 

h 

Since V r = V t -(l ~ -J) = (f )Vi, the value of j> * s *h us confined to 
the two limits defined by equations (38) and (39). 

17 See P. Debye, " Polar Molecules," Chapter I. 
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Now the energy of the hydrogen atom in the normal state is given 



by 



18 



Since Vie = jE? , the corresponding frequency is 

(40) 



Substituting for VQ in (376), the interaction energy is found to be given 
by a relation of the form 

AU=-^ (41) 

where C = 243/32 = 7.59 if VQ is calculated by means of equation (38), 
and C = f X 7.59 = 5.69 if equation (39) is used. 

The result thus obtained obviously represents only a first approxi- 
mation to the correct value. A calculation of the second-order per- 
turbation energy, in which the perturbing potential term is given by 
(36), yields a value 19 C = 6. For a more accurate calculation it is 
necessary to take into consideration dipole-quadrupole and quadrupole- 
quadrupole interaction, as has been done for helium by H. Margenau. 20 
The calculation can also be made by using the method of variation of 
parameters, which is discussed in a subsequent chapter. 

Such calculations have been carried out by R. Eisenschitz and 
F. London, 21 H. R. Hassg, 22 J. E. Lennard-Jones, 23 J. C. Slater and 
J. G. Kirkwood, 24 and L. Pauling and J. Y. Beach. 25 The values 
derived for C by the different investigators are shown in the following 
summary: 

Hass 6.49 

Slater and Kirkwood 6.49 

Pauling and Beach 6.4984; 6.499 

The value C = 6.499, obtained by Pauling and Beach, is probably 
"extremely close to the correct value." 26 Inserting this value of C 

18 #0= - 13.53 v.e. 

19 Pauling and Wilson, " Introduction to Quantum Mechanics," p. 385. 
*>Phya. Rev., 38, 747 (1931). 

2l Z.P%sifc,60,491 (1930). 

22 Proc. Cambridge Phil. Soc. 9 27, 66 (1931). 

23 Proc. Roy. Soc. (London), A129, 598 (1930). 

24 P%a. Rev., 87, 682 (1931). 

26 P%*. Rev., 47, 686 (1935). 

26 Pauling and Wilson, " Introduction to Quantum Mechanics," p. 382. 
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and the values e = 4.770 X 1(T 10 e.s.u., Co = 0.528 X lO" 8 cm., the 
value of the interaction energy of two hydrogen atoms becomes 

AE = -6.064 X KT 60 /?- 6 erg. (42) 

8.6 Van der Waals Energy for Other Atoms and Derivation of 
Constant "a." For the case of two helium atoms, F. London's 27 
method of calculating the energy of interaction leads to the relation 



4 R 

where a is the polarizability and Vi the ionization potential. Sub- 
stituting the value a = 0.205 X 10"~ 24 which is that calculated from the 
refractive index, and the value Vi = 24.5 volts, this corresponds to 



e 2 

where En = - 
2o 

By application of the perturbation theory, Slater and Kirkwood 28 
deduced for this interaction energy the value 

-3.1800 



For more complex atoms and molecules Slater and Kirkwood also 
deduced for the energy of interaction a relation of the form 



where v = number of electrons in valence shell. 

While this equation, as well as (42) and (43), expresses the energy in 
terms of ergs per molecule, it is of interest to calculate the magnitude of 
this energy per gram-molecule. In terms of kg.cal./mole, 



= - 163 X - jp -- (446) 

Thus, in the case of CH 4 , v = 8, a = 2.59 X 10~ 24 , and van der 
Waals' constant 6 = 2irNR 3 /3 = 55.9 cm. 3 /mole. Hence (Atf) 3 = 
991.2 cal./mole, that is, approximately 1000 cal./mole. Since this 
value is of the same order of magnitude as the heat of evaporation of 

27 Z. physik. Chemie, Bll, 222 (1931). 
28 P%. Rev., 37, 682 (1931). 



234 VAN DER WAALS FORCES 

solid or liquid methane, the result obtained indicates that the attractive 
forces which give rise to cohesion in non-polar liquids and solids are prob- 
ably of the same nature as those due to the interaction of fluctuating 
dipoles. As will be shown in a subsequent section, a more rigorous 
calculation confirms this conclusion. 
Referring now to equation (iv) in section 1, it is evident that 



(w 



m 1 



Since m 1 = 6, as deduced on the basis of London's theory, we 
obtain the relation 

A = -6 AJ0-JB 6 . 

Thus, for the case of hydrogen atoms, it follows from equation (42) 
that AK = 36.386 X KT 60 dyne cm. 6 , while for the case of helium 
atoms, it follows from equation (43) that 

8.91 X KT^dyne cm. 6 . 



London's theory would lead to the lower value 

,4 He = 7.44 X 1(T 60 dyne cm. 6 . 

For other atoms and molecules values of A may be derived by means 
of equation (44a). 

In order to calculate a it is necessary, as is evident from equation 
(vii), to determine the repulsive energy term involving B in equation 
(iv). J. E. Lennard-Jones has shown that the magnitude of this term 
may be derived empirically from PV versus T data for the gases, on the 
basis m = 7. The values of n thus derived vary from 10 to 13, indicat- 
ing that the repulsive force increases extremely rapidly with decrease 
in r below the equilibrium value r . 

J. C. Slater and J. G. Kirkwood 29 have shown that, in the case of 
helium, the repulsive potential energy term may be written in the 
form 

jcr 

U (repulsive) = Be o, 

where B and c are constants. A similar expression has been deduced by 
M. Born and J. E. Mayer 30 for the repulsive energy between two ions 
in a halide lattice. Actually, it makes little difference whether the ex- 
ponential form is used or an expression of the form r"" n , if n is as large 

29 Phys. Rev., 37, 682 (1931). 
*>Z.Phy8ik,n t 1 (1932). 
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He 



as 10. Figure 51 31 gives the potential energy of pairs of inert gas atoms 
as a function of their distance apart in Angstrom (1(T* 8 cm.), and it 
will be observed that, owing to the extremely rapid increase in the 
repulsive force with decrease in distance beyond the equilibrium value 
(the value of r at the minimum), 
U(r) increases rapidly in this range. 
The force at any value of r is, of 
course, determined by the slope at 
that point. 

For a first approximation we can 
assume that the value of n, the repul- 
sive force exponent, is so large that 
the repulsive energy term in equation 
(iv) may be neglected. Hence, we can 
write equation (vii) in the form 

2irN 2 



o 

f~4 

X 
&-5 



S-10 

.2 



a = 



1.013 X 10 6 



/" 

*/ TO 



(45) 



where AS is the attractive energy term 
calculated by means of equations (42), 
(43), or (44). Expressing the latter in 
the form 



we obtain the relation 




-15 



>1. Van der Waals potential 
energy curves for several gases. 



r 

/ 

/ro 



Since van der Waals' constant 6 is given in terms of r% by the relation 

JB_ 36 . 



we can write (45) in the form 



~ 3.039 X 10 6 6 

Hence, in terms of equation (44) 

1.08 X 



9.654 X 10 65 A 



10"oM 



(46) 



(47) 



41 J. E. Lennard-Jones, Proc. Pkys. Soc. (London), 43, 461 (1931). 
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and in terms of London's relation ' : 

1.13 X 10 5 VF 



(48) 



where V is the ionization or resonance potential (in volts). 

Both these relations have been shown to give values of a which are 
in reasonably satisfactory agreement with the values derived from the 
critical constants by means of (viii). 

8.7 Energies of Evaporation and Sublimation. The fact that the 
quantum mechanics theory leads to satisfactory agreement between 
calculated and observed values of van der Waals' constant a indicates 
that the attractive forces which give rise to cohesion in non-polar liquids 
and solids must be of the same nature as those described in the previous 
section. 

As mentioned previously, the interaction energy per mole as cal- 
culated on the basis of London's theory is of the same order of mag- 
nitude as the energy of evaporation for solid CH 4 . London and 
Lennard-Jones have shown that it is possible by a more exact applica- 
tion of the results deduced in the previous sections to calculate the ener- 
gies of sublimation of crystal lattices of such molecules as those of the 
rare gases, methane, hydrogen, oxygen, nitrogen, and other gases. 
The present chapter would be incomplete without some discussion of 
this application of the quantum theory of van der Waals forces. 

Given U(r) for a pair of molecules it is possible to calculate the total 
potential energy of all the molecules in a gram-molecular volume of the 
crystal lattice. The method used may be described as a summation 
of the energy for every pair of molecules in the lattice. If we denote the 
total energy by fa, then 



where C A is the so-called crystal potential constant 32 for attraction, which 
depends upon both the value of the attractive force exponent m and the 
type of crystal lattice, and U A is the attractive energy term. U R desig- 
nates the repulsive energy term and CR is the corresponding crystal 
potential constant (which depends upon the value of ft). The factor 
(f ) is necessary to avoid counting the same atom twice. 

The application of the last equation to calculate heats of evaporation 
may be illustrated by the results obtained for argon. F. London, in 

82 Values of these constants for different values of m and n and for various types of 
lattice structure, are given in Fowler's " Statistical Mechanics," Chapter X, also by 
J. E. Jones and A. E. Ingham, Proc. Roy. Soc. (London), A107, 636 (1925). 
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his calculation, assumes that the repulsive force exponent n = *> , so 
that UR = 0. For the attractive energy term U A he uses the relation 



o o 

where hv Q = F t -e, with F = 15.5 electron volts, and a = 1.63 X 10~ 24 , 
while r is the minimum distance between atoms. 

The observed crystal spacing (minimum distance between atoms) is 
3.84 X 10~ 8 cm., while C A = 115.4. Hence, the energy of sublima- 
tion in calories is 

fa = N 4.92 X 1(T 59 X 115.4 



4.184 X 10 7 2 4.184 X 10 7 X (3.84 X 1(T 8 ) 6 

= 1715cal./mole. 

The observed value, according to London, is 2030 cal./mole. 

J. E. Lennard-Jones has carried out similar calculations in which the 
repulsive energy term has been taken into account. It should be 
mentioned, however, that, in general, the value of the term C#C/# is 
small (less than 20 per cent compared with that of C A U A ). In all cases, 
the agreement between values of S calculated and those observed is 
satisfactory. Even for many organic compounds in the liquid state, the 
heats of evaporation may be calculated, with a fair degree of approxima- 
tion, as the writer has observed. Thus in these cases the forces of 
cohesion in the solid and liquid state are of the type discussed in the 
present chapter. 

On the other hand, in some cases a large fraction of the energy of at- 
traction must be due to the presence of intrinsic dipole moments, as for 
example in the case of H^O, C6H 5 N0 2 , and other molecules for which 
the electric moment jj, e is considerable. 

In the case of ionic lattices, such as that of NaCl, the attractive forces 
are electrostatic, and the corresponding potential energy is given by 

N e 2 

< (attractive) = -r-C^- , 
2 r 

where r is the minimum distance between ions. The problem of co- 
hesion in such lattices has been the subject of a large number of in- 
vestigations, and the reader who is interested in following it further 
will find his efforts well repaid. 

It should be added that the polarizability a, as well as the proper 
values of VQ, necessary for the determination of U A > may be derived on 
the basis of wave mechanics from spectral data. However, since such 
calculations involve the application of the perturbation theory, a dis- 
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cussion of the methods used as well as of other features of the more 
recent developments of the theory must be postponed for the following 
chapter. 33 
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CHAPTER IX 
PERTURBATION THEORY 

9.1 Introductory Remarks. The problem of the atom with more 
than one electron is one which, on the basis of the Bohr theory, has its 
analog in celestial mechanics, where the problem is that of calculating, 
for instance, the effect of the gravitational pull of the sun on the motion 
of the moon around the earth. The field due to the action of the sun is 
said " to perturb " the motion of the moon, and obviously the effect 
of the sun must vary with its position in relation to both the moon and 
earth. 

It is the existence of similar perturbations in atomic systems, due to 
the interaction of the electrons, that makes the consideration of such 
systems a more complex mathematical problem than that of the hydro- 
gen-like atom. Thus, in the case of the helium atom, the simplest type of 
many-electron atoms, the total energy is evidently made up of three 
terms: 

(1) the kinetic energy of each electron, 

(2) the potential energy due to the attractive force between the nucleus 
and each electron, 

(3) the potential energy due to the repulsive force between electrons. 

If we designate the charge on the helium-like nucleus by Ze, the dis- 
tance of each electron from the nucleus by ri and r 2 , respectively, and 
the interelectronic distance by ri 2 , the total potential energy term is 
given by 

Ze 2 Ze 2 t e 2 

V = 1 

TI r 2 r 12 

The corresponding S. equation is given by 

. Ze 2 Ze 2 e 2 
+ + - 

where the subscripts 1 and 2 of the Laplacian operators (in polar co- 
ordinates) refer to each electron. If the term e 2 / r i2 were omitted, the 
solution of the S. equation would be identical with that for two separate 
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hydrogen-like atoms of nuclear charge Ze. That is, the solution would be 

2, ^2), (2) 



where r, 0, and 17 are the coordinate variables of each electron with 
respect to the nucleus, and 4> is a characteristic function of the form 
dealt with previously in discussing the hydrogen-like system. Further- 
more, the characteristic energy values would be given by 

E = E ni + E n y 

where E ni and E m represent the eigenvalues corresponding to the 
functions </> ni and < n2 , respectively. 

It is the presence of the " perturbation " potential energy term 
6 2 /ri2, which makes the solution of the S. equation (1) more difficult. 
A similar perturbation term occurred in the S. equation (5.26) for the 
interaction of two linear harmonic oscillators, and there the mathe- 
matical difficulty was overcome by a transformation of coordinates. 
By this process it was found possible to separate the variables and thus 
obtain a product solution of the form indicated in (2). However, it has 
not been found possible to devise a transformation scheme for equation 
(1), by which the variables may be separated. 

Consequently, it is necessary to use other methods. Of these the 
most important are the methods based (1) on the application of the per- 
turbation theory and (2) on the application of the calculus of variations. 
It should be mentioned that the first of these represents the quantum 
mechanics modification of the method which has been used in calculations 
of orbits in celestial mechanics, while the second method involves a 
fundamental principle which has also been applied in classical physics. 

The perturbation theory has been described by E. C. Kemble, as 
" Mostly machinery a necessary evil!" The derivation of the first- 
and second-order perturbation terms given in the following sections is 
practically the same as that given by Condon and Morse. 1 It has the 
advantage over the derivation given by Schroedinger and other writers 
that it requires no knowledge of Green's theorem. 

In the following sections an attempt has been made to present each 
of the steps in the derivation as clearly as possible, and for this reason 
the argument will appear somewhat tedious at best. However, the only 
excuse that can be given for the whole discussion is that a knowledge 
of the perturbation theory and of the accompanying concept of matrix 
elements is absolutely essential for the proper understanding of the man- 
ner in which quantum mechanics has solved a vast number of problems 
presented by atomic and molecular systems. 

1 Condon and Morse, " Quantum Mechanics," Chapter IV. 
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9.2 Perturbation Theory First-Order Terms. Let us consider 
first the non-degenerate case, that is, one in which there is only one 
eigenfunction corresponding to each discrete energy state. The S. 
equation for a given system will be of the form 

V 2 u + c?(E - 7)w = 0, (3) 

where a 2 = 8ir 2 n/h 2 , and V 2 u may be of the form V 2 u + V 2 u + . . ., 
if we are considering an atomic system containing two or more electrons. 
The potential energy 7 is of the form 

7 = 7 + X7i, (4) 

where 7o represents the potential energy in the absence of any per- 
turbation, while X7i is the perturbation term. We regard the latter 
as the product of a term 7i which depends upon the coordinates and 
an arbitrary parameter X, the value of which is small. Thus in the 
case of the helium-like atom, we could write 



Z V^ r 

and regard l/Z as such a variable parameter whose value is small for 
large values of Z. 
The S. equation for the unperturbed system is of the form 

V 2 u + a 2 (E - V )u = 0, (5) 

and we shall assume that solutions of this equation are known. We 
shall designate these by u^ and the corresponding eigenvalues by J5$J, 
so that they satisfy the S. equation 

vV n + a\El - 7<X - 0. (6) 

Furthermore, we shall assume that these functions u are ortho- 
normal. That is, 



orn==w 

where dr indicates the element of " volume " (which in the case of an 
atomic system will be of dimension 3v, if v represents the total number of 
electrons), and the integration is carried out over the whole domain in 
which the coordinate variables have a physical significance. This 
domain is known as the configuration space and will be designated by 
this term in the subsequent discussion. The symbol 5 nm is used gen- 
erally in quantum mechanics in order to indicate that the expression may 
have the value 1 or according as n is equal, or not, to w, and we shall, 
henceforth, use it in the sense defined in equation (7). 
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Let us now assume that the functions and energy values which are 
obtained by solving the S. equation (3), for the perturbed system, may 
be represented in terms of the functions u% and E% by expressions of 
the form 

U n = ul + X* n + X 2 Xn , (8) 

E n = #n + X n + \V, (9) 

where <t> n and xn are functions of the coordinate variables, which repre- 
sent the first-order and second-order perturbations, respectively, while 
n and Tj n represent the corresponding perturbation energy terms of the 
first and second order, respectively. The parameter X is identical with 
that used in equation (4). 

If, now, we replace 7, u y and E in equation (3) by the expressions in 
equations (4), (8), and (9), respectively, and carry out the multipli- 
cation indicated, the resultant equation is of the form 

~> 2 4 + XVVn 



= 0. (10) 

Since X is an arbitrary parameter, it follows that the coefficient of 
each power of X in equation (10) must vanish identically. The co- 
efficient of X = 1 is evidently the same as the left-hand side of equation 
(6) and merely states the obvious conclusion that, for X = 0, the solu- 
tions of equations (5) and (3) are identical. 

The coefficient of X in equation (10) yields the inhomogeneous differen- 
tial equation 

- e n )< (11) 



If the right-hand side of this equation were equal to zero, the equa- 
tion would be of the homogeneous type and identical in form with equa- 
tion (6). The presence of the expression on the right-hand side of 
(11) necessitates a special procedure in order to solve the equation. 

Since <t> n is assumed to be a function of the coordinate variables 
which is finite and continuous over the whole domain, it may be ex- 
pressed as a " Fourier's series " in terms of the orthonormalized func- 
tions 1$. That is, it is possible to develop the function </> n in the form 

(12) 
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where the summation is extended over all integral values of k from 
to oo , and any one of the coefficients B nk can be derived by the relation 

B nk 

as indicated in the supplementary note, Chapter VI. However, we 
shall leave the coefficients undetermined for the present. The sub- 
script nk indicates that the coefficient is that of the function u k and 
furthermore that it refers to the series development for the function < n . 
Thus, for each characteristic solution u n of the S. equation, there will 
exist a function <t> n which may be represented by an infinite series of terms, 
each of which is the product of a constant (that is, a magnitude such as 
B nk which is independent of the coordinate variables) and one of the 
series of orthonormal functions u n . 
Substituting from (12) into (11), the result is 



(13) 
But, according to equation (6), 



Hence equation (13) may be written in the form 

I) B nk (El - El)ul - (Vi - e n )4. (14) 

Multiplying both sides of (14) by IZ and integrating over the con- 
figuration space, the result is 

uldT -fl&ViiAdr - n fu n u n dr. (15) 

Since c n is a constant (the perturbation energy of the first order), it 
may be written outside the sign of integration, whereas V\ must be 
kept inside the integration sign because it is a function of the coordinate 
variables. 

Now for k = n, E % - JBg = 0, while for k 7* n, Ju Q n uldr = 0. 
On the other hand, since v% is a normalized function, 



1. 
Hence, we deduce from (15) the very important result 

(16a) 



244 PERTURBATION THEORY 

" Thus," as Condon and Morse remark, 2 " the linear correction to 
any energy level is simply the average value of the linear term in the 
potential energy function weighted according to the value of the squared 
characteristic function at each place. This theorem is the quantum 
mechanical analog of a similar result in classical mechanics: The altera- 
tion of the energy of a quantum state in the first approximation is equal to 
the average of the perturbation potential taken over the undisturbed orbit.' 9 

If w is a real function, equation (16a) becomes 

(16&) 

We may now proceed to determine the coefficients B n ^ in the expan- 
sion for <, as given by equation (12). If in equation (14) we multiply 
through by ft] and integrate over the configuration space, we obtain the 
relation 



Because of the orthogonality relation (7), the only non-vanishing 
term on the left-hand side is that for which k = j, while the coefficient 
of n will vanish for j 7* n. Hence, 






(17) 



(A - 

where j ^ n. 

To determine the value of B nnj we make use of the requirement that 
u nj the perturbed eigenfunction, must also be a normalized function. 
Hence, 

fu n u n dr = 1 ^Jululdr + \fu n <t>ndr + xj*u n ? n dr 

+ \*f$n<t>ndr + X 2 JVnXndr + X 2 f u^dr + - 
Because of the validity of equation (7) it follows that 
* xf jtfr&ndr +J ul$ n dr )+ terms in higher powers of X. 

Since X is a variable parameter, each power of X must vanish iden- 
tically, and using the series for < n it follows that 

l$)dT - 0. 
2 Condon and Morse, " Quantum Mechanics," p. 119. 
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In this equation, however, each term, except that for which k = n, 
vanishes because of the orthogonality relation (since B n k may be written 
outside the sign of integration for each term). Consequently, 



2B nn ftJnU n dT = 0, 



and hence, J3 nn = 0. 
That is, the series for n is of the form 



where the prime over the summation sign indicates that the value 
k = n is excluded in the summation, and 

V kn -futV^dr. (19) 

The quantity Vkn has been designated, for reasons which will be 
discussed in a subsequent section, as a " matrix element," and in the 
following remarks this designation, as well as the symbol Vkn, will be 
used to represent integrals of the type shown in equation (19). 

In this connection it is also necessary to point out that, since V\ is a 
function of the coordinates, the expression u^Vi is identical with 
u Q n 7 ittfc. However, in view of the fact that in many quantum mechanical 
problems the matrix elements contain operators which play the same 
r61e as FI (but which are non-commutative), the order of the factors in 
any one element becomes of prime importance. For this reason it has 
been considered advisable in the consideration of matrix elements such 
as Vknt or e n , to write the expressions for the integrands in the form 
used in (16a) and (19) rather than in that used in (106), even though 
Vi may be commuted with the functions U Q n ul or tZw. 

9.3 Second-Order Perturbation Terms. To calculate the second- 
order perturbation terms rj n and Xn> it is necessary to consider the in- 
homogeneous differential equation obtained from equation (10) by 
equating the coefficient of X 2 to zero. The resulting equation has the 
form 

Xn + (El ~ F )Xn - -WA - (* - Vl)*. (20) 



As in the calculation of the first-order perturbation terms, we assume 
that Xn may be represented by a Fourier's series expansion in terms of 
the orthonormal functions u . That is, we assume 

x . = LC-* (2D 
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where the coefficients <7 n * of each term will have to be determined by a 
procedure analogous to that used for the determination of the coefficients 
B n k in the series for </> n . 
Substituting (21) in (20), the result is 



But 



Hence, 

- (c n - Fi)0n. (22) 



Multiplying both sides of this equation by ft and integrating, it 
follows from considerations similar to those used in deriving the value 
of n that 



Vkn 



where V n k is a matrix element defined by equation (19). 

The method used for the determination of the second-order perturba- 
tion function \ n is similar to that involved in the derivation of the 
first-order term n . 

From equation (22), by using the series for < w , it follows that 

M. (24) 

(The subscript j is used to differentiate the members of the series from 
the series on the left-hand side.) 
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Multiplying both sides of equation (24) by #$ (where i is one of the 
series of values, 1, 2, . . . j . . . k . . . ), and integrating over the con- 
figuration space, we obtain the relation 



C nt (ft - E t ) = nnufV^dr - ZB ni nWdr, (25) 



/*** 



since the terms on the left-hand side of (24) which do not involve E 
vanish, while on the right-hand side the term involving >r\ n vanishes 
since i j& n, 
Now in (25) 



also f or j = i 
while for ,;' j& 



Substituting these relations as well as that for B n j from equations 
(17) and (18), it follows that 



(El - 



(26) 



where the two primes over the summation sign indicate that the two 
values j = i and j = n are excluded. It may be shown that the require- 
ment of normalization for the function u n leads in this case (as in the 
case of the coefficients B nj ) to the conclusion C nn = 0. 

9.4 Perturbation Terms in Relation to Matrix Elements. It is 
important for an understanding of the " language " used in discussing 
problems in quantum mechanics to consider the relationship to matrix 
elements of the various perturbation terms derived in the previous 
sections. 

A matrix is defined as an array of magnitudes in rows and columns, 
It is merely a table in which certain elements are arranged according to 
a definite order. 

A very simple type of matrix is obtained from a consideration of 
three linear equations in x, y, z of the form 

a>ix + biy + Ciz - 
fyy + c%z = 
fa + c 3 z = 0. 
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These three equations will yield definite values of x, y, and z if the 
determinant 



= 0. 



#3&3 C 3 
/<*!&! 

a 2 6 2 c 2 1 1 



The array of values 



indicated by round brackets, is known as a matrix. 

More generally, if we have n equations in the n variables Xi 
these will be of the form 

== 
= 



x n 



= 



and the corresponding matrix is 



The element of the mth row and fcth column is indicated as a mkf and 
it will be observed that the diagonal elements are designated by a^jb 
or a mm . 

Now let us consider the integral Vkn defined in equation (19), 
and arrange all the possible values of this integral as a matrix. Since 
k or n can have any integral value from 1 to > , the matrix will consist 
of a double infinitude of elements, which we can indicate thus: 



V 12 



.F 



2n . 



V kn . 



(Since the determinant will be indicated by enclosing the array in 
vertical lines, we may omit the brackets when considering the cor- 
responding matrix.) 

Now it will be observed that the first-order perturbation energy terms 
are given by the diagonal elements of this matrix. That is, 

n ^ V nn 
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The coefficients B n k of the functions u k in the series expansion 



for the perturbation term of the first order in the eigenfunction, are given, 
as shown in equation (18), by the relation 

n v * , 

nk " (El - El) 

where B nn 0. These coefficients evidently correspond to all the 
elements of the nth column with the exception of that element which 
occurs on the diagonal. 

The second-order perturbation energy is given by equation (23), 
where each term V nk V kn is obtained by multiplying the kth element 
in the nth row by the kth element in the nth column. The term (V nn ) 2 is 
excluded from the summation. It will be observed that the method 
used in deriving the expression for ry n is similar to that used in the 
multiplication of two determinants. 

Since V\ is a real function of the coordinates, and 

V kn = 



it follows that V n k Vkn is a real magnitude, and that V nk and V kn 
are complex conjugates. That is, | V nk I = I Vkn I A matrix having 
the property that elements which are symmetrical with respect to the 
diagonal form a complex conjugate pair, is known as of the Hermitian 
type. It will be found that all the matrices which occur in quantum 
mechanics are of this type. 

It will also be observed that each term Vy Vj n in the expression given 
in equation (26) is real and the summation corresponds to the series 
obtained by multiplying each term in the ith row by the corresponding 
term in the nth column, omitting those terms which are on the diagonal 
in each case. 

9.5 Perturbation Theory for Degenerate Case. 3 In the previous 
sections we considered the perturbation theory for the non-degenerate 
case, that is, the case in which corresponding to each eigenvalue E^ 

8 The reader will find this section less difficult after he has studied the discussion 
of the helium atom problem which will be given in the following chapter. 
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there exists only one eigenfunction wj. In considering the problem of 
the hydrogen atom it was shown that corresponding to any one eigen- 
value En> there exist n 2 eigenfunctions. This is therefore an illustra- 
tion of a degenerate state, in which the degeneracy may be removed 
by the perturbing effect of magnetic or electrostatic fields. 

Let us consider the case in which, corresponding to the nth eigen- 
value E%, there exist a number v (which depends upon n) of eigen- 
functions 



In the presence of a perturbing field, the single energy state of quantum 
number n splits up into v states for which the energy values may be 
designated by 

E n , - El + \6 nM . (27) 

Since each of the functions u^ is a solution of the wave equation for 
the unperturbed state, the wave equation for each of the v states ob- 
tained by removing the degeneracy must reduce, as X tends to zero, to 
some linear combination of the functions u^. Since there are v func- 
tions, it will be possible to form from them v linearly independent 
functions of the form 

Un^XC^, (28) 

where M and 7 may each assume the values 1, 2, . . . v. The functions 
lib are designated the zero-order wave functions, and C M7 is a coefficient, 
the value of which varies for each term of the series. 

Hence, the first-order approximation to each of the eigenfunctions 
for the v states obtained by removing the degeneracy is given by 

t*n M + \<t> n ^ (29) 



where nM plays a r61e similar to that of $ n in equation (8). 
The S. equation for the perturbed states is given by 



X nM - V - XFi) (Un, + X0 nM ) - 0. 

It follows that 



vV nM + a \El - F )i4 - 0; (30) 

also that 

V 2 *n M + W - Wi* " "(V* ~ O4- (SI) 

Evidently equation (30) is satisfied because of the validity of equation 
(28). To determine c n we proceed by assuming that ^ can also be 
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developed in terms of the unperturbed functions u$ ft in the form 

(32) 



where the values of the coefficients AW will depend upon the particular 
values of n and /* as well as on k and ft. The notation ufa refers to any 
one of the set functions which correspond to the same energy state 

. 

Substituting from equation (32) in equation (31), the result is 



where e n is written instead of e nM . 

Multiplying both sides by fl^ and integrating over the configuration 
space, it is seen that 



For Jfc n, this relation becomes 

- /Sn(yi - OtiUr. 
By substituting for i*^ from equation (28), it follows that 

= Z C M7 [ Jc&VitAydr - e n JE>VT]; 
-ECirrfVff-c^); (33) 

7 

where 

Vfe* -fftfiVii&Jr, (34) 

and 



For eac/i vaZi^e o/ /3 there will exist a relation similar to equation (33), 
and if there are v eigenf unctions u%p (or u^ y , u^, etc.) corresponding to 
the same energy value E^ in the degenerate case, there will be obtained 
v linear equations in the v coefficients C^, where 7 = 1, 2, ... v. 
Each of these equations will be of the form 

0. 



The v equations thus constitute a system for the determination of the 
v constants C Ml , C M2 > etc. The condition that such a system has a 
solution (other than the trivial one that each coefficient vanish iden- 
tically) is that the determinant of the coefficients of these constants 



0, (36) 
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should vanish. That is, the condition is of the form 

r T7 T/ TT 

11 ~"~ *n> ' 12? ' 13 j * " IP 

V V * Fo V 

K21> V 22 n> ' 23> K 2v 

r V "F *V 

V vl , F, 2 , F, 3 , . . V vv - e 

where it will be observed that owing to the validity of equation (35) 
the only coefficients of c n in equation (33) which are not identically 
equal to zero are those which are situated on the diagonal of the deter- 
minant in (36). 

The order of this determinant is evidently the same as the order of 
degeneracy of the state for which the energy is E%. Consequently, the 
equation for n will be of the form 

a n *n + an-ie*-^ + a = 0. (37) 

That is, it will be an algebraic equation of degree v, and will possess v 
roots. 

Equation (36) or (37) is known as the secular equation. That the 
roots are real is evident from the consideration that in the case of real 
functions Vp y = F 7 0, while in the case of complex conjugate functions 
it may be demonstrated as in the case of the elements in equation (18) 
that Vp y and F T constitute a complex conjugate pair. 

An illustration of the application of the secular equation to the case 
v = 2 will be given in the following chapter in the discussion of the 
helium atom problem. 

Having determined the v different values of n , it becomes possible 
to obtain for each of these values a solution of the set of equations (33) 
for the ratios of the C's. In other words, for each value nfji a set of 
values of the ratios of the coefficients C^ I C^i; C/*s I ^nil @nv I CMI 
is obtained, from which the expansion indicated in equation (28) for 
u f nlt may be written down. 

The procedure for the determination of the coefficients A^ in (32) 
is similar to that used in the non-degenerate case, but more tedious. 
The relation deduced for u nM is of the form 




+ X 

k 

where the summation inside the brackets extends over all states for 
which fc j n. 

9.6 Application to Coupled Linear Oscillators. As mentioned at the 
beginning of this chapter, the perturbation theory is essentially a 
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method of transformation by means of which the solution for the per- 
turbed energy state is expressed in terms of the known solution for 
the unperturbed state and of the expression for the perturbing potential 
energy function. 

It is therefore of interest to consider the solution by the perturbation 
theory method of a problem for which the solution may also be derived 
by means of a transformation of coordinate variables. Such a problem 
is that of the coupled linear harmonic oscillators considered in the 
previous chapter. The S, equation for the system as shown in that 
connection is of the form 



where a 2 = 8ir*n/h 2 and gi and # 2 refer to the coordinates of each particle, 
measured along the same coordinate axis. 

It was shown that this equation could be transformed by a change of 
the variables to the so-called " normal " form, so that the resulting 
characteristic function could be expressed as the product of two functions 
(each a function of only one variable). In this case the perturbation 
term in the expression for the potential energy function is given by 
2e 2 qiQ2/R, and it was found that this leads to a perturbation term in 
the expression for the total energy of the form 



which corresponds to an attractive force between the oscillating particles. 

Let us now consider the same problem from the point of view of the 
perturbation theory. 

Using the same symbols as in the problem of the linear harmonic 
oscillator (see Chapter V), the frequency of each identical oscillator in 
the unperturbed state is given by 



We shall replace (ft and # 2 in equation (39) by the new variables x\ 
and x%, respectively, defined by the relation 



(40) 
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Consequently, equation (39) becomes 

~ x\ - x\ + Xxxx 2 * = 0, (41 ) 



where a 2 , 



4e a 

(42) 



Thus the perturbation potential energy term is Xa^i^ where X is a 
parameter, the value of which vanishes for the unperturbed state 
(fl-). 

The characteristic function for the unperturbed state is 

(43) 



and the energy of the system for the quantum states, n and m of each 
oscillator, is 



- (n + m + 1)H. (44) 
As shown in Chapter V, the function <t> n (x) is real and has the form 



= 



where H n (x) is the Hermitian polynomial of the nth degree. 

Applying equation (16a) for the first-order perturbation energy, 
which will be designated by Af? (l) , it follows that 



where the limits of integration are db > for each variable. Hence, 

(45) 



But as was shown in Chapter V, the integrals on the right-hand side 
of (45) correspond to the average values of x\ and x 2 respectively, and 
each of these vanishes. Hence, 



0. 
Let us now consider the second-order perturbation energy. Accord- 
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ing to equation (23) this is given by the relation 

(46) 



(Bum - E n'm>) 

where 



and n f and m' correspond to the values of the quantum numbers n and m 
for any other state of the system. Thus the total energy of the oscillators 
in the unperturbed state n'm 1 is given by 

E n > m > = (n 1 + m' + 
Evidently, 

Vtf = - f 4>n(Xi)4>n'(Xl)Xld*lJ 

and the summation in (46) is extended over all possible values of n' 
and w', excluding the state n f = n, m f = m. 

Now for the oscillators under consideration, n = m = 0. Also, as 
shown in Chapter V [see equation (5.36) and subsequent discussion], 
the integral 



n' =J <t 



vanishes unless n' - n = 1. Since n = 0, the only possible value of 
n r is 1, and similarly for m'. Furthermore, since 



Vi 1 



it follows that 



This constitutes the only term in the summation indicated in equa- 
tion (46). To determine the corresponding values of the denominator 
#00 - #ii> it; must be observed that in equation (41) both a/6 and x are 
dimensionless, and hence in equation (46) AJS (2) is expressed as a multiple 
of a unit of energy. Since a/6 = 2E/hv<>, the unit of energy is hvo/2. 
In terms of these units, 
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Consequently, 



which is identical with the result obtained in equation (5.20). 

In order to obtain the corresponding perturbation terms in the 
eigenf unction for the system a much more tedious calculation is necessary, 
and hence the method used in Chapter VIII is much more convenient 
in that respect. It should, however, be pointed out that, though 
AU (1) vanishes, it does not necessarily follow that the corresponding 
first-order perturbation term in the eigenfunction also vanishes. Thus, 
if we apply equation (18) and equation (5,36) we obtain the result, 



- 



2, 2 



_ 

since fa (a;) = 2x e 2 . 

Hence, to a first approximation, the eigenfunction for the system 
will be given by 



W \ 2 /' 



which shows that w will be greater or less than WOQ according as 
is positive or negative. 

9.7 Interaction Energy of Two Hydrogen Atoms. In Chapter VIII 
the problem of the interaction of two hydrogen atoms, at such a dis- 
tance that there is no interchange of electrons, was treated as a problem 
of three pairs of linear harmonic oscillators, each pair acting along one 
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of the three orthogonal coordinate-axes. This is the method used by 
F. London 4 and also by J. E. Lennard-Jones. 5 

On the other hand, R. Eisenschitz and F. London 6 have applied 
the second-order perturbation theory, using for this purpose the wave 
functions for the hydrogen atom. The following remarks give a sum- 
mary of their extremely comprehensive paper. 

At relatively small distances of separation two hydrogen atoms may 
react to form a molecule. The large binding energy arises from the 
fact that the two electrons, originally associated with separate nuclei, 
constantly interchange places. This problem was first treated by 
London and Heitler and is discussed in a subsequent chapter. As the 
distance of separation between the two atoms increases, the frequency 
of such interchanges decreases and at sufficiently large distances be- 
comes negligibly low. Under these conditions, however, there exists 
an attractive energy which is due to dipole-dipole interaction and which, 
as shown in the previous chapter, accounts for van der Waals forces. 
The perturbing potential energy term for this interaction is given by the 
relation 

-j 
Vp " B 3 

In terms of n, r 2 , and R (see Fig. 50) this may be expressed in the 
form 7 

P i Yn \ I 

V p = 3 cos (ri, r 2 ) 3 cos (ri, R) cos (r 2 , R) , (49) 

where cos (r*i, r 2 ) refers to the cosine of the angle between r t and r 2 , 
and similarly for cos (TI, R) and cos (r 2 , R). 

Let ^i(l) denote the eigenf unction for the hydrogen atom in which 
electron 1 occurs and <h(2) the eigenfunction associated with electron 
(2). Then, the first-order perturbation energy is given in accordance 
with equation (166) by the relation 



It may be shown that since the eigenfunctions are spherically sym- 
metrical, the integral in this equation vanishes, so that A2? (1) = 0. 

4 Z. physik. Chem., Bll, 222 (1931). 

6 Proc. Phys. Soc. (London), 43, 461 (1931). 

6 Z. Physik, 60, 491 (1930). 

7 See Jean's " Treatise on Electricity and Magnetism " for the derivation of this 
relation. 
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Let 1? designate the energy of hydrogen atom in state of total quan- 
tum number n. Then, in accordance with equation (23), the second- 
order perturbation energy is given by the relation 

V ln V 



lnnl 



In this equation, n f and n 11 refer to excited states from which transi- 
tions are permitted to the normal state n = 1, and the prime above the 
summation indicates that we must omit the case n' = n" = 1. These 
states must therefore be of the nature of p-states (I = 1), for which the 
eigenfunctions are not spherically symmetrical. 

The matrix elements are defined by the relation 

(51) 

in which, in order to simplify the argument, it is assumed that the 
eigenfunctions are real. 

The evaluation of the individual terms in equation (50) is evidently 
quite involved. By adopting a simple approximation to the series, 
L. Pauling and E. B. Wilson, Jr. 8 have derived the value 



while Eisenschitz and London, by carrying out the more laborious com- 
putation, obtained a value for AJ5 (2) in which the factor 6 is replaced by 
6.47. These conclusions are to be compared with those given in the 
previous chapter regarding the value of C in equation (5.41). 

It is of interest to point out that the physical interpretation of equa- 
tion (50) is essentially the same as that of equation (46). 
Combining equation (49) with equation (51), it is seen that the 
matrix element V ln involves the product of two integrals which are 
as follows: 



r (53) 

and Mn" = e I r 

These evidently correspond to dipole moments. Now, as is dis- 
cussed more fully in Chapter XV, the magnitudes of these moments are 
related to the intensities of the lines corresponding to the transitions 
n' -> 1 and w" - 1, respectively. Thus, the value of the second-order 

8 " Quantum Mechanics," pp. 384-385. 
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perturbation energy may be derived from observations on spectral in- 
tensities. It is also evident from these considerations and those stated 
in the previous chapter that the polarizability a may be calculated from 
the values of the energy levels and the corresponding eigenfunctions. 
It is because of the existence of this connection between a and the 
van der Waals forces that the latter have been designated " dispersion 
forces." (See footnote at the end of the previous chapter.) The dis- 
cussion in the following section illustrates even more directly this point 
of view of the van der Waals forces. 

9.8 Application of Perturbation Theory to Stark and Zeeman Effects. 
In presence of magnetic fields (Zeeman effect) or strong electric fields 
(Stark effect) spectral lines are resolved into components. 9 Since 
any spectral line is due to a transition between two energy states, the 
observed effects must be due to the perturbing effect of the field on the 
motion of the electron about the nucleus, in each of the two energy 
states. 

A simple illustration of the Stark effect is the problem discussed in 
Chapter VIII and in the previous section, which dealt with the effect 
on the energy states and eigenfunctions of a linear harmonic oscillator 
of the field due to an identical oscillator at a distance. 

In the case of the hydrogen atom, the 8*. equation for the perturbed 
state in presence of an electric field of strength F applied in the direction 
of the z-axis is of the form 






(54) 



where \V\ - eFz is the perturbing potential energy function. 

(In discussing, in Section 8.4, the effect of an electric field on the 
linear harmonic oscillator, the negative sign was used for the perturbing 
potential energy function because the electric charge was regarded as a 
positive magnitude. However, since a field directed in the positive 
direction of z exerts a force on the electron in the negative direction of z, 
the positive sign must be used in the present case. It should also be 
noted that with this convention in signs, the value to be used for the 
electron charge e should be regarded as a positive magnitude.) 

According to equation (16a), the first-order perturbation energy is 
given by 

(55) 

9 For description of these observations see F. K. Richtmyer's " Introduction to 
Modern Physics," also H. S. Taylor's " Treatise on Physical Chemistry," Chapter 
XVI. 
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Let us consider the case n = 1, 1 = m = 0, that is, the normal state 
of the atom. Then <t> is a function of r alone, and if <fo is multiplied by 
z and integrated over the whole of the configuration space the integral 
must vanish. Consequently, A (1) = for the normal state of the 
hydrogen atom. 

In the case of the second-order perturbation term, equation (23) 
shows that 



where 



V\k = J $lQ<)Z<t>klmdT. (57) 



If I = m 0, then the integral vanishes for the same reason as in 
the case n = k = 1. But if either or both of the eigenfunctions are not 
spherically symmetrical then V lk has a definite value which may be 
calculated from the expressions for the eigenfunctions, and from the 
values of E%. 

An interesting application of equation (55) is in the calculation of the 
polarizability. 10 As pointed out in the previous chapter, the increase 
in energy of an atom in a uniform electric field of strength F is given by 

B 9 --~ (&26) 

But Ep> the polarization energy, must be identical with AU (2) , the 
second-order Stark effect. Hence, 



That is, the polarizability can be calculated, as mentioned in the 
previous section, from the values of the energy levels and corresponding 
eigenfunctions. 

As shown by E. Schroedinger 11 and subsequent investigators, the 
relations deduced from equation (54) are in good agreement with the 
actual observations on the splitting up of the hydrogen lines in an 
electric field. 

In the case of the Zeeman effect the first-order perturbation energy 

10 See references in collateral reading. 

11 Ann. Phydk, 80, 437 (1926). See also Condon and Morse, " Quantum Me- 
chanics," pp. 123-129. 
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term is given by the same relation as was derived in the older theory, 
that is, 



where m may have any one of the integral values ranging between 
I and +Z, and H is the strength of the magnetic field. 

COLLATERAL READING 

1. Illustrations of the applications of the perturbation theory are given in (1) 
PAULING and WILSON, "Quantum Mechanics/' Chapter VI, and (2) CONDON and 
MORSE, "Quantum Mechanics," p. HGetseq. 

2. The method used for deriving equation (58) for the polarizability is that given 
by LINDSAY and MARGENAU, "Foundations of Physics," pp. 460-469. This section 
also contains a discussion of the application to the Zeeman effect. 

3. The reader will also find it of interest to consult the following references on the 
perturbation theory: 

(1) KEMBLE, E. C., Phys. Rev. Suppl, 1, 157 (1929). 

(2) SLATER and FRANK, "Theoretical Physics," Chapter 32, which develops 

the relations by use of the secular equation. 

4. EISENSCHITZ, R., and LONDON, F., Z. Physik, 60, 491 (1931). The first part 
of this comprehensive paper is quite difficult to follow, but the reader will find it 
instructive to study more fully the second part which deals specifically with the 
problem of the interaction of two hydrogen atoms. This paper is also of interest in 
connection with the Heitler-London treatment of the problem of the H2 molecule 
which is discussed in a subsequent chapter. 



CHAPTER X 
THE HELIUM ATOM PERTURBATION METHOD 

10.1 Atomic Units. In dealing with the problem of the helium atom 
and atomic systems in general, it is convenient to use a system of so- 
called atomic units, which were introduced by D. R. Hartree 1 and 
have been adopted quite generally by writers on quantum mechanics. 2 

These units are defined as follows : 

Unit ofkngthj the radius of the Bohr orbit in the normal state (n = 1) 
of the hydrogen atom, which is designated by do, where 

h 2 

a ~^v ; 

Unit of charge e, the charge on the electron; 
Unit of mass ju, the mass of the electron. 

Consistent with these are the following: 

Unit of action ft/(2?r), which is usually designated by ft; 
Unit of energy e 2 / a o> which is twice the ionization energy of the hydro- 
gen atom with fixed nucleus, that is, 2 X 13.53 e.v. 

Hence in terms of these units 

r = <m , (1) 

\-e 2 \-4irV 4 

and E = - = - ~~- (2) 

ao h 

= X 2Rch, 

where cr and X are dimensionless numbers, and R is the Rydberg con- 
stant for infinite mass, that is R^. 

Let us now consider the radial equation for the hydrogen-like atom 
which, as shown in equation (7.8), has the form 

2 dS . fftr*jJg . SirW 1(1+ 1)] 



1 Proc. Cambridge Phil. Soc., 24, 89 (1928). An interesting discussion of this topic 
is given by E. U. Condon and G. H. Shortley in the Appendix of their treatise, " The 
Theory of Atomic Spectra." 

2 H. Bethe uses these units in his article in " Handbuch der Physik," Vol. XXIV, 
Parti. 
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Evidently, 

^Li *2. ^ = 

dr ~ao'dff ' dr* "ag'd 

X 32*Ve 2 2X 



v .. , ~ 2Z 
and 



Hence, equation (3) assumes the form 



d<r^ (r dcr 
This has the solution [see equation (7.32)] 




where Po 

n 



i 

That is, Po = ' (7) 

1 na 

as in Chapter VII. (The symbol p is used to distinguish this from the 
symbol p used subsequently.) 

The eigenvalues corresponding to the eigenfunctions S n i(<r) are 
given by the relation 



4*V Z 2 Z 2 

That is, E a = -- 






The complete eigenfunction for the system is given, as shown hi 
Chapter VII, by the function 



, 
which is the solution of the S. equation 
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For the state n = 1, I = 0, it follows from equation (7.35) that the 
normalized eigenfunction is given by 



0100 = -- 7= ' *> w) 

VTT 



and the eigenvalue by 



that is, by 



10.2 The S. Equation for the Helium Atom. For the helium atom, 
the S. equation in terms of ordinary units [see equation (9.1)] is 

* <-> 

where TI and r 2 are the distances of the electrons from the nucleus, and 
r 12 denotes the interelectronic distance. In terms of Hartree units this 
equation becomes 

V?0 + V|0 + 2^ X + - + - - V = 0, (106) 

\ <TI <r 2 <ri2/ 

which may also be written in the form 

V?0 + Vi0 + 2(| + - + - - -M0 = 0. (lOc) 

\Z (7i 0"2 6012 / 

The last equation shows that, for Z * >, the perturbation term 
vanishes, and the solution is then given by the " zero-order " eigen- 
function 

00 = 



where (TI is used instead of PI, 0i, ft, and similarly for cr 2 . 
The corresponding eigenvalues are evidently given by the relation 

!? + 2$)' (12) 

which follows from equation (8). 

Evidently the S. equation (106) or (lOc) is unaltered if the coordinates 
of the two electrons are interchanged. That is, the S. equation is 
satisfied by either <t>i(cri) '02(0*2) "* U i2 r 01(^2)02(^1) ^21* Con- 
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sequently there must exist a relation of the form 

UK = c u i2 , 

where c is a constant which when multiplied by u\2 converts it into 
w 2 i. But if we multiply the function u 2 i by c, we must obtain the 
function u\%. Hence, 

ui 2 = c 2 -1*12; c = 1, 
and W2i 



The zero-order energy corresponding to each of these two functions 
is the same, as is evident from equation (12). Thus, we have here a 
twofold degeneracy, owing to the fact that the two states defined by w 2 i 
and ^12 are indistinguishable when there is no interaction between 
the electrons. Equation (13) shows that, for an atom with two elec- 
trons which are in different quantum states, the eigcnfunctions may 
behave in one of two ways for an interchange of coordinates of the 
electrons. The functions either remain unaltered or they change sign. 
The former are designated symmetrical, and the latter antisymmetrical, 
eigenfunctions. In the case of helium, the symmetrical functions 
correspond to the para-states, and the antisymmetrical to the ortho- 
states, as will be shown in a subsequent section. 

Only in the normal state (n\ = n 2 = 1, l\ = k = 0) is this de- 
generacy absent. We shall therefore consider first the solution of 
the S. equation for this case. 

10.3 The Normal State of the Helium Atom. The zero-order eigen- 
f unction for this state is 

*o = *iM+M - - e-* ( *i+*2>, (14) 

7T 

and the corresponding zero-order energy is evidently 



That is, in ordinary units 

4 * * (15) 



Applying the first-order perturbation theory to equation (106), it 
follows from equation (0.16) that the perturbation energy is given by 

AX (1) = f ^(<n)0f (a 2 ) 4?^ 4fd<72, (16) 

/ CT12 

where </>(<r) has the form indicated in equation (9). 
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Let Zv SB p, then the normalized functions are of the form 



and equation (16) assumes the form 



AX (1) 



=Z f 0?(pi) 4p?* l -*?(p a )-4rp5* s . (17) 

/ Pl2 



The integral in this equation represents, physically, the energy of inter- 
action of two charge distributions. Though it could be calculated by 
expressing pi2 as a function of PI, p2, and (the angle between the two 
radii) by the relation 



Pis = Pi + P2 

and then eliminating 6 by an integration with respect to this variable, a 
much simpler method is available, which involves the application of 
potential theory, 3 

Let us consider a spherically symmetrical charge distribution described 
by the f unction /(p). The total charge contained in a spherical shell of 
thickness dp at p = p, is, therefore, /(p 8 ) 47rp?dp, and the potential of 
this shell at any point is given by 



_ for p > p f , 

and by 

/i\ 

for p < p a . 



Hence, the potential at a point p = p 8 , due to the total charge dis- 
tribution represented by 



/(Pi) = 
is given by 

- 4* j- fV(Pi)pf<*Pi 

lpa/0 



4 - 



The evaluation of these and similar definite integrals is discussed in 

8 The reader will find an especially interesting discussion of this topic in the paper 
by A. Unsold, Ann. Physik, 82, 355 (192&-27). See also Appendix IV, Section 17. 
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Appendix III. Substituting these values in the last equation, the result 
obtained is 



P.) 

Po 

Let us now consider the interaction energy of this charge distribution 
with another charge distribution defined by < 2 (p 2 ) = (l/?r)e~" 2/> 2. This 
may be derived from a summation of terms, each of which gives the 
interaction energy between the charge distribution <t> 2 (p\) and the 
charge located in the shell of thickness dp at p = p 2 . It is seen that 
these terms are of the form 



where p 2 takes the place of p in equation (18). 

Hence, the total interaction energy due to the two charge distribu- 
tions is 



P2/0 



+ 4 



As shown in Appendix III, the evaluation of these integrals leads 
to the result 

tf - 4 ( - - 



Consequently, we deduce from equation (17) the result 

AX - f Z, 
and hence the perturbation energy in ordinary units is given by 



- f RchZ. (20) 

4 This equation is the same, except for the difference in units of length, as equa- 
tion (9) of Unsdld's paper. 
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Hence, the total energy, corrected for the first-order perturbation, is 
given [see equation (15)] by 

#<> + A# = -2RchZ 2 + %RchZ. 

This gives the energy of the helium atom with respect to the doubly 
ionized atom, as represented by the process 



On the other hand, the energy absorbed in the process 



is RchZ 2 . 
Hence, the energy required for the process 

He - He 4 " + e, 

that is, the ionization energy of the helium atom, 5 is 
V, = Rch(Z* - fZ) 

= 13.53 X 1.5 = 20.30 volts. 

Since the observed value is 24.47 volts, there is a discrepancy of 4.17 
volts, that is, of about 17 per cent of the correct value. But it should 
be noted that without the correction term f Z, the calculated value 
for Vi would have been 13.53 X 4 = 54.1 volts, so that the correction 
introduced by use of the first-order perturbation has reduced the cal- 
culated value from one which is more than twice the observed value to 
one which is 17 per cent lower than that observed. By calculating the 
second- and higher-order terms of the perturbation energy, a much 
better approximation to the correct value could be obtained. However, 
by the application of the method of variation of parameters (discussed 
in the following chapter), it has been foun<l possible to avoid the tedious 
calculations involved in the use of the perturbation theory method, with 
more satisfactory results. 

10.4 Excited States of the Helium Atom. Let us now consider the 
case in which one of the electrons is in the state n = 1, 1 = (state 1), 
and the other is in the excited state n = n, I = (state n). As pointed 
out previously we find that in this case it is possible to have two eigen- 
functions corresponding to the same zero-order energy value. These 
may be described thus: 

Electrm 1 Electron 2 Eigenf unction 

State: 1 n F ln = <t>\(pi)<t>n(p*) 

State: n 1 F nl = 4> n (pi)4>i(p2) 

6 Actually the value of R for helium should be used in this calculation. The 
difference is, however, negligible for the present purpose. 
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The system is therefore twofold degenerate, and we must apply the 
perturbation theory as developed in Chapter IX for this case. Instead, 
however, of utilizing directly the results deduced there, it will un- 
doubtedly prove of help to the reader for a proper understanding of the 
more general discussion given in that chapter, if we apply these argu- 
ments to the more specific problem under consideration. 

Since the functions Fi n and F n i are equally valid solutions of the S. 
equation for the system, any linear combination of these will also be a 
solution. We therefore put 

F - aF ln + bFm, (21) 

and write for the solution of the perturbed system 

* = F + ft 
and for the energy of the perturbed system, in Hartree units, 

X = Xi + X 2 + * (22) 

Substituting these in the S. equation (106), we obtain the equation 

2Z 2Z 2 



( 



Since F ln and F n i satisfy the S. equation for the unperturbed system, 

(2Z 2Z 
2X 1 + 2X a + + 
<TI < 

Neglecting products of the second order, it follows that 

VV + (W + 2X 2 + - + -V - ( - 2,V (23) 

\ <?1 027 Vl2 / 

This equation is inhomogeneous, and in order that it shall have a 
solution, it is necessary, as shown in the first-order perturbation theory 
for the non-degenerate case, that the right-hand side of (23) shall be 
orthogonal to each of the solutions Fm and F n i of the corresponding 
homogeneous equation. Hence, we obtain the two relations 



Fu-dr-O, (24) 

and (s-rt)F'F nl 'dT = 0, (25) 



270 THE HELIUM ATOM PERTURBATION METHOD 



where the integration is carried out over the configuration space defined 
by the element of " volume," 



dr 



s 






(26) 



Substituting from (21) it follows that 

af(s - n)F 2 ln dr + b f(s - iy)F ln F nl dr - 0, 
and 

af(s - i)F ln F nl dr + bf(s - ^F^dr - 0. (27) 

Let 

ra (28) 



12 7 21 =fsF nl F ln dr 



Now it is evident that 

/ FnlFmdr = J *i(<Ti)0 w (^l)4w?d^lJ 

Also that 



(29) 



0. 



Furthermore, since the perturbation function 1/0-12 is symmetrical 
with respect to <r\ and 0*2, it follows that 

Vn = 7 22 . 
Hence, we can write equations (26) and (27) in the form 

and 

a>V\z + b(V n - 1;) = 0. 

The solution of this pair of equations is given by solving the " secular " 
equation 



that is, 



V 12 , 

(Fa - u) 2 - 



0, 



(30) 



0. 
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It follows that 

* = Vn V 12 , 
and 

a = 6. 

Therefore, the two zero-order eigenf unctions are 

FS = a(F ln + F nl ) 
and 

FA = a(F ln - F nl ), 

where FS is evidently a symmetrical function, since it does not change 
sign when the coordinates of the two electrons are interchanged; while 
FA is antisymmetricaL 

Since FS and FA must each be normalized, it follows that 

in + F* nl )dr 2a 

where the last integral on the right-hand side is equal to zero. 

Therefore, 2o 2 = 1; a = 1/V2, and the two zero-order eigen- 
functions are 

1 f 1 

F S = 4>lM0n(<r 2 ) + <t>nM<t>M\ ' (31) 



and 

FA = iM4>n(^) - ^nG^ifo)- (32) 



The corresponding eigenvalues to a first-order approximation are 
given in Hartree units, by the relations 



Fii+7i2; (33) 

+ V n - F 12 ; (34) 



as is evident from equation (12). 

The last two relations signify that, as a result of the interaction of the 
electrons, that state, for which the energy is (1 + l/n 2 )RchZ 2 in 
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absence of such interaction, is split up into two states which differ i] 
energy values. 
For the case n = 2, as shown in the next section 

FII = 0.210Z; 
V 12 = 0.022Z; 
so that, 

E s =\s- 2Rch = - (1 + |) RchZ 2 + 0.464#c/iZ; 

and 

E A = \ A 2Rch = - (1 + i) flcAZ 2 + Q.mRchZ. 

Thus JU and E$ represent spectral energy levels which are symmetri 
cally located with respect to the energy value E = (1 + \)RchZ 2 H 
QA2QRchZ, and the ZeveZ corresponding to the antisymmetric state ? 
lower, since it is more negative. That is, the binding of the two elec 
trons to the nucleus is greater for the antisymmetric state. If no repu 
sive forces existed between the electrons, the binding energy to th 
nucleus would be (-|)#c/iZ 2 . The existence of repulsive forces decrease 
this energy by QAZQRchZ, corresponding to the term FH. While th 
existence of this matrix element is therefore readily interpreted on th 
basis of classical concepts of interaction of electrical charge distribi 
tions, the existence of the additional term Fi2,inthe expression for th 
energy, results, as is evident from the previous considerations, from th 
solution of the S. equation, and is therefore a non-classical deductioi 
That this result is a purely quantum mechanical conclusion is of e^ 
treme significance, as will be pointed out in another section. Since th 
occurrence of the term d=Fi 2 is due to the fact that the electrons ma 
interchange " orbits," integrals of this type, which occur frequentl 
in quantum mechanics calculations, have been designated as exchan( 
integrals in contradistinction to the Coulomb integrals of which th 
term FH is a typical example. 

10.5 Calculation of Matrix Elements FH and F J2 . To determir 
the values of the matrix elements FH and Fi 2 , it is necessary to evalual 
the integrals in equations (28) and (29) for specific values of n. As a 
illustration of such a calculation, we shall consider the case n = ! 
I = o, m = 0, that is, the state of the electron designated spectre 
scopically as 2s. The hydrogen-like eigenfunction for this state i 
given, in accordance with equation (7.36), by the expression 

$200 = 02 = 7^: ' ("^ 

VN 
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where p = Z<r, and 

tf - C" (p - 2)* e-p 
Jo 

- 4p 3 



Thus the problem of evaluating the integral Vn is similar to that of the 
integral in equation (17), and therefore 



Z r *l(p 2 )4,rp 2 {- f %?( Pl )47rp?d Pl 

vp 2 lP 2 */0 

/ 

+ I </>?( 

^P 



dp 2 ; 



f /"{'"'(P 8 - V + V) - <-*'(' - 3p' + 4,)}*,; 

o t/o 



(36) 

By analogy with the expression given in the last equation for cal- 
culating the integral FH, the integral V\t is expressed in the equivalent 
form 



ri *o 

+ f <2(pi)*i(pi)4WpiL 2 . (37) 

*^ J 



That is, we replace < 2 (p2) and <I(PI) by " mixed " distributions 
02 0> 2 )<i (p 2 ) a- n d <^ 2 (pi)0i(pi), as ^/ eac/i electron were continuously alter- 
nating between the two quantum states, so that the effective charge dis- 
tribution is a geometrical mean, as it were, of the two well-defined 
distributions. From this point of view the origin of the designation 
" exchange integral " is readily understood. 
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Substituting in the integral in (37) from equations (9) and (35), the 
result is 6 



v . 8 - 2) 

V 12 



1 P *~ 
I 

P t /o 47TV2 
/ - ~p -- 4irpidpi dp 2 , 



1 A 

= z -j = 0.0439&M. 
3 

For the general case in which one electron is in the state n = 1, 1 = 0, 
and the other in the state n > 1, I = 0, the integrals V\\ and Fi 2 may 
be expressed, as shown by Unsold, by the relations 



in terms of 2Rch as a unit of energy. For different values of n, the con- 
stants a and 6 have the values indicated in the following table 

n: 2 3 4 

a: 0.0802 0.0232 0.00973 

6: 0.0439 0.0115 0.00468 

The fact that a decreases with increase in n is due to the decrease in 
Coulomb energy of repulsion between the electrons. The electron wave 
functions " overlap " less as n is increased. The decrease in the ex- 
change energy V\% with increase in n is accounted for on the same 
basis. 

10.6. Exchange Energy. As mentioned already, the existence of 
terms such as V\% is a purely quantum effect which has no classical 
mechanics analog. The nearest analog to the phenomenon is that 
of the interaction of linear harmonic oscillators, or that of the electron 
oscillating between two adjacent potential boxes. As pointed out in 
Chapter VIII, each of the single energy levels which exist for the unper- 
turbed states of the system (in absence of any coupling action) is split 
up, as a result of the interaction, into two levels. The difference in 
energy of any pair of such levels decreases with increase in n. Also 
corresponding to each of these pairs of levels there exist a symmetric 
and an antisymmetric eigenfunction. 

6 See footnote 3 of this chapter. 
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Because of this formal resemblance of the interchange phenomenon 
in the helium atom to the resonance coupling of two linear harmonic 
oscillators, the term Viz has also been designated as a resonance energy. 
However, it cannot be emphasized too strongly, that any such analogy 
should not be used to give a physical interpretation of the results de- 
duced in the previous sections. Rather, the occurrence of the term 
Fi2 must be regarded as a logical consequence of the fact that in the 
excited state of the helium atom the system is two-fold degenerate. 

The difference between the symmetric and antisymmetric states 
of the excited helium atom may be represented diagrammatically by 
Fig. 52, which is taken from the excellent discus- 
sion of this topic by C. G. Darwin. 7 

The two electrons are assumed to be in "orbits " 
with angular momenta differing by one quantum 
(n\ = 1, r&2 = 2). As Darwin points out, " If one 
electron is found to be at 0, then in the symmetric 
mode the other is most likely to be at S, whereas p ia 52. Symmetry 
in the antisymmetric mode it is most likely to be and antisymmetry of 
at A. 19 From this point of view the fact that E A electrons. From 

is greater numerically than E s is evidently due to " The New Conce P" 
,, ,, ., j /. ,1 i , tions of Matter, by 

the smaller magnitude of the repulsive energy term c G Darwin 

between the electrons when they are in the anti- 

symmetric state, so that the net attractive energy between each electron 

and the nucleus is greater. 

It is extremely important to realize that the term Fi 2 is not due to the 
action of any new type of force. It merely expresses the fact that the 
two electrons are indistinguishable in their motions. It is thus an in- 
direct result of the validity of the Principle of Indeterminacy. 

Instead of the designations Vkk and V^j for the Coulomb and exchange 
integrals, respectively, a number of writers on quantum mechanics have 
adopted different symbols for the two types of perturbation integrals. 
Thus Pauling and Wilson use the letters J and K to indicate Coulomb 
and exchange integrals respectively. For instance, 




< = f 

/ 



(712 



The subscript s refers to the fact that the eigenfunctions are of the 
spectral type s. 

7 " The New Conceptions of Matter," p. 205. 
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As emphasized previously, the integral K arises from the fact that 
the two electrons are constantly interchanging " orbits." It will also 
be observed that the existence of this energy term is bound up with the 
occurrence of a twofold degeneracy, as indicated in equation (29), by 
the relation 



f 

J < 



FinFnidridT 2 . 



Furthermore, in the case of the helium atom, this expression has 
a positive value. This result will appear of special significance in 
the discussion of the hydrogen molecule problem in a subsequent 
chapter. 

Since, in classical mechanics, we define a force as the negative deriva- 
tive of the energy with respect to distance, in accordance with the rela- 
tion 

dE 



we could logically consider the exchange energy as due to " exchange 
forces." However, the form of the expression for 7i 2 shows that such 
a calculation would not only be tedious, but also meaningless. 

In fact, we find that in quantum mechanics, the concept " force " 
is not required. After all, the only magnitude susceptible of measure- 
ment is that of the energy involved in the formation of an atom from a 
nucleus and electrons, or the energy of dissociation of a molecule. 
Even in the case of van der Waals forces, which were discussed in 
Chapter VIII, the quantum mechanics calculation led to a value for 
the interaction energy of the form 




From this it follows as a mathematical consequence that 



and if we wish to regard this as defining a force which is a function of r, 
it is permissible to do so. But actually this deduction does not in any 
manner help toward a better understanding of the phenomenon. 

10.7 Orthohelium and Parhelium. As shown in the previous sec- 
tion, quantum mechanics leads to the conclusion that the spectrum of 
helium should exhibit two sets of energy levels. These are defined, in 



ORTHOHELIUM AND PARHELIUM 
the case 1 = (s-levels) by the relations 

E s = - 
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+ 4 J RchZ* + (4 - V* + bRchZ-, 




FIG. 53. Energy levels in spectrum of helium. 
In terms of wave numbers [if = E/(hc)] these are given by 

*a,4-Zl 

where (a 6) refers to v s and (a + 6) to V A , that is, for any given 
value of n the term with the smaller numerical value of v should corre- 
spond to the symmetrical state. 
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Actually, the spectrum of helium (see Kg. 63) shows two sets of 
energy levels which were at one time regarded as characteristic of two 
different atomic species. These are known as the parhelium and ortho- 
helium series. Each level is designated by the quantum states of the 
two electrons. Thus, the lowest level is Is 2 which indicates that each 
electron is in the state n = 1, I = (s-levels), while in the two next 
higher levels, one electron is in the state Is and the other in the state 
2s (n = 2, 1 0). It will be observed that, for any given values of n, 
the level in the parhelium series has the numerically smaller value of the 
wave number. Consequently, the parhelium terms must correspond to 
the symmetrical eigenfunctions and the orthohelium terms to the anti- 
symmetrical eigenfunctions. 

The energy diagram also shows that the terms in the orthohelium 
series are actually triplets, whereas the corresponding terms in the par- 
helium series are singlets. Thus, there are three levels in the ortho- 
helium series corresponding to the electron configuration Is2p (n = 1, 
I a for one electron, and n = 2, 1 = 1 for the other electron). Two 
of these levels (column designated 3P 2 , i) are so close that it is im- 
possible to indicate them on the diagram as separate levels. 

In order to interpret the existence of multiplet levels such as those 
which occur in the case of helium, it has been found necessary to intro- 
duce the concept of electron spin. Since this concept and the related 
generalization, known as Pauli's Exclusion Principle, are of funda- 
mental importance in quantum mechanics, we shall consider these 
topics in the following section. 

10.8 Electron Spin. Pauli's Exclusion Principle. As shown in 
Chapter VII, the solution of the S. equation for the hydrogen atom leads 
to a number of eigenfunctions which require for their designation the 
three quantum numbers, n, I (where I = n - 1, n - 2, . . . 0) and 
m (where m = dbZ, (Z 1) . . . 0). Since all the eigenfunctions for 
any given value of n have the same eigenvalue 

** 



we designate this state as degenerate. It was also shown that the 
degree of degeneracy for a given value of n is n 2 . 

For any given values of I and m, the total angular momentum of the 
electron in its motion about the nucleus is given by 
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while the component of this momentum about the Z-axis is 

M 9 = m 
2ir 

In the presence of perturbing fields, such as those produced by the 
presence of other electrons in the atomic system, or by magnetic or 
electrostatic fields, the degeneracy may be removed either partly or even 
completely. Thus in the case of all the atomic systems for which 
Z > 1, the degeneracy with respect to I is removed, so that the states 
corresponding to different values of I, for the same value of n, have 
energy values which are no longer identical. These states are desig- 
nated spectroscopically by the symbols s (for I = 0), p (for I = 1), 
d (for I = 2), and so forth. 

The observations on the effect of magnetic fields on spectral lines 
(Paschen-Back Effect) show that in a magnetic field the total number 
of states corresponding to a given value of I is not 21 + 1 as expected 
from the previous considerations but twice this number. Thus the 
maximum number of levels for a Is, 2s, or us electron is 2, in spite of the 
fact that m can have only the single value 0. Similarly, for an np 
electron the maximum number of levels is 2(2 X 1 + 1) = 6, and for 
an nd electron, 2(2 X 2 + 1) - 10. 

Formally, this may be accounted for, as first pointed out by Uhlen- 
beck and Goudsmit, by the assumption of a spinning electron. The 
magnitude of the total angular momentum of spin is Vs(s + 1) h/(2ir), 
where s designates the electron spin quantum number. The com- 
ponent of this momentum in any given direction is m 8 (/i/27r), where 
m 8 is a fourth quantum number, the value of which is either +i or 

-"" J5 

Thus any electron has assigned to it not only the three orbital quan- 
tum numbers n, I, m, but also a fourth number known as the magnetic 
quantum number m 8 (= J) which defines the component of angular 
momentum with respect to a given axis. 

By this theory it is possible to account not only for the observations 
on the effect of magnetic fields on spectral lines, but also for the existence 
of multiplet levels as deduced from the spectra of atomic systems 
containing more than one electron. 

For quantum mechanics the significance of these considerations is of 
importance because of Pauli's Exclusion Principle, which may be 
enunciated as follows: 

In any one atom there cannot exist two electrons having the four quantum 
numbers (n, 1, m, and m s ), respectively, the same in both. 
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Thus, if for two electrons, n\ = ri2, l\ = ^2, mi = w 2 , then m, can- 
not be identical for both, but must have the value +-| for one and \ 
for the other electron. That is, two electrons which are identical with 
respect to each of the three " orbital " quantum numbers, must be anti- 
parallel with respect to the directions of the vectors which correspond 
to the spin momenta. 

Pauli's principle is essentially the real justification for the scheme of 
electron distribution first suggested by E. C. Stoner and independently 
by J. D. Main-Smith in 1924. Assuming that the maximum number of 
electrons of any one type (s, p, d, etc.) which an atomic system may 
possess is given by the value of 2(2Z + 1), it is possible to deduce the 
variation in electron distribution as electrons are added in succession 
to a nucleus of charge +Ze. On this basis an adequate interpretation 
has been obtained of the periodic arrangement of the elements and of 
the variation in the characteristics of their spectral terms. 

10.9 Multiple! Levels in Spectrum of Helium. From Pauli's prin- 
ciple it follows that in the lowest electronic state (normal state) of the 
helium atom, in which both electrons are in Is states, the spins must be 
in opposite directions. Since the normal state belongs to the parhelium 
series, this argument is additional confirmation of the conclusion, stated 
in a previous section, that the parhelium states correspond to the 
symmetrical eigenfunctions. 

To account for the existence of multiplets it is necessary to intro- 
duce into the quantum mechanics formulation of the eigenfunctions 
the electron spin concept. The procedure used for this purpose is as 
follows: 8 

It is assumed that corresponding to the electron-spin there exists 
an eigenf unction ^(s) which designates the orientation of the axis of 
spin in a magnetic field. For two electrons there will exist two such 
functions \fr(si) and ^($2)* and consequently the complete spin function 
for such a system may be represented by any one of the combinations 
shown in the attached table. The first two columns give the values 
of m 8 , the third that of 2w 8 , and the last column the corresponding 
combination. 

m n m, 2 Sm, Eigenfunction 



\ -i o 

-i i o - =7 

-i -i -i #<-i)#(-i)-a 

8 The following remarks are based upon the discussion in A. Sommerfeld's "Wave 
Mechanics," pp. 231-233, 
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The eigenfunctions j8 and 7 evidently represent a degenerate state, 
since they correspond to interchange of electrons, and we must therefore 
replace them for the perturbed state of the system by (ft + 7)/V2 and 
(0 ~ 7)/V2. The factor l/\/2 is introduced for the same reason 
as it was introduced in equations (31) and (32) for the orbital eigen- 
functions FS and FA- We thus obtain three spin eigenfunctions which 
are symmetrical in the electron spins, viz. : 

tf + 7) ,, 
a, -p , and 5, 

V2 

and one which is antisymmetrical, viz.: ()8 7)/\/2. The first set 
give rise to triplet terms, for which the values of m are 1, 0, and 1, 
while the last function corresponds to the singlet term w 4 = 0. 

The complete eigenfunctions for the system of two electrons are ob- 
tained as the products of orbital and spin functions. According to equa- 
tions (31) and (32), the orbital eigenfunctions are 

F s = 

and 

1 

fji ^^ / C* E* i /i < i 

t? A "^" ' ' ' I f \ M """'" f' jl I "" u/ "*" Vt 

V2 

Hence, we obtain the following eight combinations of orbital and spin 
functions which we shall arrange in two sets: 



(A) 



V2 V2 



Group (A) evidently contains only symmetrical functions, while 
group (B) consists of the antisymmetrical functions. Now in the 
energy-level diagram for helium we observe that corresponding to a 
given value of n and a value of Z > for the excited state of one of the 
electrons, there are only four levels, three of which belong to the triplet 
(orthohelium) series and the other to the singlet (parhelium) series. As 
deduced previously, the latter corresponds to the function which is 
symmetrical in the orbital functions and antisymmetrical in the spin 
functions. This evidently means that the parhelium series corresponds 
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to the eigenf unction (u + v) (ft y)/2 and that the orthohelium series 
corresponds to the other three members of group (B). 

This conclusion may be stated in the following very significant 
generalization: The complete solution of the wave equation for any atomic 
system must involve only that type of eigenfunction which is antisymmetrical, 
that is, changes sign when the electrons are interchanged. Evidently, 
this is the quantum mechanics interpretation of Pauli's Exclusion 
Principle, and we shall find it extremely important in the consideration 
of valence problems. 

One other deduction which should be mentioned in this connection 
is the folio wing: 

In absence of any interaction between the electron spins and the 
orbital motions, no transition can occur between a state corresponding 
to a symmetrical eigenfunction Fg and another state corresponding 
to an antisymmetrical function FA- As shown in the energy-level 
diagram, Fig. 53, there is only one line (X591.6), and that a faint one, 
which corresponds to a transition between the ortho- and parhelium 
series. On the other hand, with atoms of higher nuclear charge, where 
there is interaction between the electron spin and the orbital motion, 
lines corresponding to transitions between singlet and triplet levels 
frequently occur. 
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CHAPTER XI 
THE HELIUM ATOM VARIATIONAL METHOD 

11.1 Minimum Energy Principle. The Variation Theorem. The 

variational (or Ritz) method has been used extensively in classical 
physics, especially in the field of dynamics, where its application is at 
least as old as d'Alembert's principle of virtual displacements. In 
Chapter IV this principle was discussed as well as the subsequently 
developed Hamilton's Principle, which was stated in the following 
form: 

For any dynamical path, the time integral of the function 
L = L(qi, $i, t), known as the Lagrangian or kinetic potential function, 
must be an extremum. The function L is defined by the relation 

L - T - V, 

where T, the kinetic energy, is a quadratic function of the generalized 
velocities & and , while V, the potential energy, is a function of the 
generalized coordinates #;. 

Hamilton's Principle can be expressed in the variational form 



f l Ldt = 

t/*o 



(T - V)dt = 0, (4.79) 



and, as may be shown by the methods of the calculus of variation, the 
conditions which must be satisfied in order that the integral in the last 
equation shall be an extremum are given by the / equations, one for 
each generalized coordinate, of the form 



dt \d 

The last equation is known as Euler's equation in the calculus of 
variation, and also as Lagrange's equation in dynamics. It will be 
observed that in equation (479) the variation applies to a function L 
and not to a coordinate (as in ordinary calculus), and it is because of 
this generality that the criterion (4-36) has proved so extremely useful 
in solving many problems in classical dynamics. 

1 In most cases it will be found that the integral which satisfies the differential 
equation W-36) is a minimum. However, the proof of this conclusion would 
involve much more tedious mathematics. 
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It was therefore not unreasonable to expect that the method of 
variations would find equally important application in connection with 
the problems of quantum mechanics. In fact, the most striking feature 
about the whole history of the development of theoretical physics during 
the past two hundred years has been the continuous extension of con- 
cepts. The experimental discoveries always seem to demand revolu- 
tionary changes in ideas and yet, when in the course of time these 
observations are arranged in a logical frame, it is found that many of 
the concepts held previously require only a slight modification, or ex- 
tension, in order to be able to reconcile them with the new facts. 

In the case of quantum mechanics it is readily shown that the S. 
equation is essentially the Euler differential equation which must be satis- 
fied in order that a certain definite integral (which corresponds to the 
total energy) shall be a minimum. 2 Hence, instead of attempting to 
solve the S. equation in a particularly difficult case, it may be much 
more feasible to introduce one or more arbitrary parameters into the 
expression for the corresponding variational integral. By investigating 
the effect of variations in the values of these parameters, on the value 
of the integral, it is then possible to determine a minimum value for 
the latter, that is, a minimum value for the eigenvalue which corre- 
sponds to the given S. equation for a particular energy state. 

Let us consider first the S. equation in the operator form 

= E<t> 9 (1) 

where a 2 = 87r 2 /i//i 2 , and H, known as the Hamiltonian, designates the 
operator in the brackets. Multiplying both sides by $ and integrating 
over the configuration space, the result is 



= E 
where dr is the element of volume in this space. Hence, 



J 



(2) 



fodr 



For normalized functions, the denominator in (2) is equal to unity, 
and E = I $H<t>dr, where <t> is an eigenfunction for the given S. equation. 

* This was pointed out by E, Schroedinger in his first paper on " Quantization 
as an Eigenvalue Problem/' Ann. Physik, [4], 79, 361 (1926). 
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Suppose, however, that we do not know the exact form of < which 
should be used to solve equation (1), We may then make a more or 
less shrewd guess as to the form of the function. Let $ designate this 
new function, and let 



and 



J 



N 



(3) 



Then, as shown in the following section, the minimum value of /, 
which we shall denote by M, is always greater (more positive) than the 
true eigenvalue for the corresponding S. equation. That is, 



or 

- E I W&T ^ 0. (5) 



For ^ = <t>t the correct eigenfunction, the expression on the left-hand 
side of the last equation is equal to zero. 

To demonstrate the validity of (4) we proceed as follows: 3 

If U = M/N = minimum, then 



Hence, 

dM - EPdN = 0. 
That is, 



C(d$)Htdr + C$H(W)dT - B f(8$)tdr - E ffatdr - 0, 



or 



(H - B)^r + $(H - B)a*dr = 0. (6) 



Since ty is arbitrary, let us replace it by ity* Hence, since ^ = | ^ | 2 , 
a real function, we must replace 6$ by -iS^, in order that the product 

3 A proof of this theorem was first given by C. Eckart, Phys. Rev., 36, 878 (1930). 
For the proof given in the text the writer is indebted to Dr. F. Seitz. 

4 This part of the proof is taken from notes on lectures delivered by P. A. M. 
Dirac at Princeton University in 1931. 
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( ifif ) (18$) shall be real. Consequently, (6) becomes 

(H - E*Wdr + i(H - E)S^dr - 0. 



Dividing both terms by i, and comparing the resulting equation with 
(6), it follows that the latter can be valid only if each integral vanishes 
identically. 5 Hence, 

(H - E"W - 0, (7) 

which shows that JE is an eigenvalue corresponding to the function ^. 

These considerations show that the S. equation is the Euler condition 
which must be satisfied in order that the integral I in (3) shall be a 
minimum. The latter may also be expressed in another form, in which 
it has been used quite frequently by writers on quantum mechanics. 

Referring to equation (1), which is merely the S. equation, let us con- 
sider the case in which <t> as well as V are functions of the Cartesian co- 
ordinates Xj y, z, and is a real function. (This limitation to real 
functions and rectangular coordinates simplifies the mathematics but 
does not detract from the general validity of the conclusions.) Then, 



But ^2^ 

~dx*~ ~ ~dx\ dx J \dx 
Hence, designating dxdydz by dv, 




, , , r 

2 H "T H ~r~~ + I 

a 2 L dx dy dz J J 

where the limits of integration arez = j/ = 2=oo. Since <j> van- 
ishes at the limits of integration the expression in the brackets must 
be equal to zero. Hence, the integral to be minimized is 




subject to the condition I <t> 2 dv 1. 



6 An operator 0, such that Jw = 4>$ is known as Hermitian or self-adjoint. 
Evidently H or (H E) is an operator of this type. 
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That is, 




(where q denotes x, j/, or z) must be a minimum. 

Thus the S. equation is the differential equation which must be 
satisfied in order that the integral in equation (8) shall be a minimum. 

The form of this integral indicates a method by which equation (8) 
may be derived from the expression for E in the Hamiltonian form 



In this relation let us replace p x by (h/2ir) d<t>/dx, and similarly for 
p v and p z . Then we obtain the relation 



in which the expression in the brackets is identical with the first term 
in the integrand of equation (8). 

This may be extended to any system for which the total energy is 
expressed in the Hamiltonian form, that is, as a function of the / 
generalized coordinates 31 . . . g/, and canonically conjugated 
momenta pi ... p/. 

Let 



Under these conditions the element of volume is given by 
dr = VA dqi . . . dq/, 

where VA is known as the " discriminant/' 6 and therefore the integral 
to be minimized is 

with the condition that 

= I <t> VA dqi . . . dq/ = 1, 

where the integration is carried out over the whole configuration space. 

6 This was discussed in Chapter VI in connection with the form of the Lapla- 
cian operator. The discriminant is thus identical in the case of three coordinates, 
with the magnitude Vaicfeas used in equation i 
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Now in any treatise on the calculus of variation it is shown that, given 
a definite integral of the form 



the condition to be satisfied in order that this integral shall be a minimum 
is given by the Euler equation 



where <,- = d<t>/dqj, and i and j are any pair of the numbers 1,2. . . /. 

If we apply this condition to the integral in (9) we obtain the S. 
equation 



where E is the minimum value of I in (9) for a given form of <, and 
Aik is a coefficient of the term involving d 2 <l>/(dqidqk). 

Thus equation (10) leads to the expression for the Laplacian operator 
in the S. equation 7 

2 4 + (E - V)* = 0. 



As an illustration of the application of these equations, let us consider 
the problem of the linear harmonic oscillator, which was discussed in 
Chapter V. 

In terms of g, the displacement, and p, the corresponding momentum, 



Hence, the variational function is given by 




ition I <f> 2 dq 

/ 00 



with the condition I <f> 2 dq = 1. 



7 See also E. U. Condon and P. M. Morse, " Quantum Mechanics/' Chapter I, 
and E. C. Kemble, Phys. Rev. Suppl., 1, 157 (1929), for more detailed discussion 
of this topic. 
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We can combine these two relations in the form, analogous to equa- 
tion (11), 




Denoting the expression in the integrand by F(q, <, d<l>/dq), it is evi- 
dent, if we denote dfy/dq by < Q , that 

dF 2h 2 dd> 



d<t> q Sir n dq 
Hence, 



dF 




and 



= if (7 E) is to be a minimum. 

The right-hand expression in the last equation is obviously the same 
as the S. equation (5.5) for the linear harmonic oscillator. 

11.2 The Energy for the Normal State of the Helium Atom. In the 
previous chapter the energy of the normal state was calculated by use of 
the relation for the first-order perturbation energy. The value derived 
in this manner for the ionization potential of He was found to be 4.17 
volts too low. It will be shown in this section that by the use of the 
variational method it is possible to deduce a value which is much nearer 
to the spectroscopically correct value. 

As shown in equation (lO.lQb) the S. equation for the system, in 
which the eigenvalue is expressed in terms of Hartree units, has the form 



+ Vl + 2(\ + - + - - V - 0- 

\ <Tl CT 2 <7l2/ 



Let us assume a normalized solution of the form 



(13) 
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where ft is a parameter which has to be determined from the condition 



Equation (13) means, physically, that we choose a solution in which 
the nuclear charge Ze is replaced by the effective charge ftZe. 

For the state n - 1 of a hydrogen-like atom of charge pZe, the S. 
equation has the form, as shown in Section 10.1, 



(15) 

\ ff / 

This leads to the eigenvalue 

X > 

and the eigenf unction 

ffi7.\^ 

(16) 



Thus, the suggestion of a solution of the form indicated in equation 
(13) may be interpreted thus: 

We regard the repulsive interaction of the electrons as equivalent in 
effect to a decreased force of attraction between each electron and the 
nucleus. That is, instead of regarding each electron as acted upon by a 
nuclear charge of magnitude Ze, we replace the latter by an effective 
charge of magnitude Ze, where $ < 1. From this point of view 
we may also consider that each electron screens the other electron to 
some extent and hence we may designate (1 0) as the " screening 
constant." 8 

The application of the principle of minimum energy leads to the 
relation 

/ toHfodr = minimum = X / 4>*dr. (17) 

For X expressed in Hartree units, the operator H is defined, in con- 
sequence of equation (12), by the relation (see Supplementary Note) 



8 C. Eckart, Phys. Rev., 36, 878 (1930). 
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Hence, 



- \z (-+-)- 

L Vl OV <T12 

But from equation (15) it is seen that 



and a similar relation applies to 
Therefore, 



l 



and 



where 



J 2 

Vl 

/I 
<t> 2 (*! ) 
(T12 

Since the functions are normalized, 

J l = -0 2 Z 2 . 
Since the integral J 2 must be symmetrical in 0*1 and <r 2 , 



J 2 



= 327(0 - 1) 
- 1)Z 2 . 
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Also, from equations (J0.16) and (10.20) we obtain the relation 



Hence, 

X J <t%dr - Xo = Ji + J 2 + 



- Z 2 {-/3 2 + 2/3 (0-1) 
For Z = 2, 



and 

X=Z 2 (/J 2 -^)- (20) 



Equation (20) gives X, the value of the energy in Hartree units, as a 
function of the parameter 0. This will be a minimum for the value of j3 
determined by the condition 



That is, - H 

from which we obtain for the value of B , the energy in ordinary units, 
the relation 









This represents the energy of formation of He from the two electrons 
and a nucleus of charge Ze. Hence, the ionization potential of the 
helium atom is given by 

Vi - ftt X 4 x 13 - 53 volts 
= 22.94 volts. 
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It will be observed that this value is 1.53 volts less than the observed 
value (24.47 volts), whereas the first-order perturbation relation leads 
to a discrepancy of 4.17 volts. * 

The value = f- means that the effective nuclear charge is f$ 
in units of e (the charge on the electron), and that consequently the 
interaction of the electrons is equivalent to a decrease of ^ units in the 
nuclear charge. 

Instead of the form assumed for the function <<> in the preceding 
discussion it is obviously possible to find other forms which lead to 
minima that approach the true value more closely. A number of 
theoretical physicists have attacked this problem by more involved 
mathematical methods, 9 and of these the most successful results have 
been obtained by E. A. Hylleraas. 10 

The method used by him involves the choice of a system of coordinates 
which has also proved suitable for other problems in quantum 
mechanics. For this reason a summary is given in the following section 
of his technic. 

11.3 Variational Method of Hylleraas. In the following section 
we shall replace <TI, or 2 , and <ri 2 by ri, r 2 , and r*i 2 , respectively. In terms 
of these coordinates the element of volume is derived as follows. Let 
r i> I* f denote the polar coordinates of the first electron, and r 2 , 0, x> 
those of the second referred to the vector ri as polar axis. The angle 
between TI and r 2 is given by 0, and hence 

r? 2 = r\ + rf - 2rir 2 cos e, 
so that 

ri 2 dri 2 = r*ir 2 sin Odd. 

The element of volume is given by 

dr' = r\ sin 6 dr 2 dSdx rf sin ijdridijdf 
= rir 2 ri 2 dridr 2 dri 2 dx sin ijdrid{. 

For spherically symmetrical distributions, we may replace AT* by 
the element of volume 

/* /2ir /2ir 

dr = rir 2 ri 2 dridr 2 dri 2 I sin ijdrj I d\ I df 
Jo Jo Jo 

(22) 



9 A list of the variational functions used and of the values of E calculated by 
each of these methods is given by L. Pauling and E. B. Wilson, " Introduction 
to Quantum Mechanics/' p. 224. 

10 E. A. Hylleraas, Z. Physik, 64, 347 (1929); see also ibid., 48, 469 (1928); 
65,209(1930). 
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Now let us introduce the so-called " elliptical " coordinates 

t - ?i + r 2 , (23) 

t - n - r 2 , (24) 

and let 

u = r 12 . (25) 

Since 4rir 2 = s 2 J 2 , 

dr = 27r 2 (s 2 - t 2 )u du dridr 2 . (26) 

In order to express the element of area dridr 2 in terms of ds dt we 
make use of the following relation between these elements (which is 
derived in any treatise on integral calculus): 11 

J , , 

**-** < 27 > 

From the functional relations between s, t, r\ t and r 2 in equations 
(23) and (24) it is evident that 

ds , dt ^ ds ^ dt 

_ L I * . _ I ._ i- - i , .. .^ i 

3rj drj dr 2 dr 2 

Consequently (27) becomes 

2drjdr 2 = ds dt, 
and (26) becomes 

dr = 7r 2 (s 2 - e 2 )w dw ds dt. (28) 

The limits of integration are evidently 

u ^ t ^ u, 
^ u ^ s <oo. 

Let <t>(s t t, u) denote some function of the coordinates. This will be a 
solution of the S. equation (12) if the following condition is satisfied. 



I = j <t>H4>dr = minimum = X, 
where 



N 
The Hamiltonian operator has the form 

11 The expression in the parentheses is known as a Jacobian. 
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where 



By application of Green's theorem, the expression 

<f>H<t> = - J0(V} + V|)0 + F0 2 (30) 



can be transformed into a form more suitable for integration. This 
theorem (see Appendix IV) states that, if U is a function of the rec- 
tangular coordinates x, y, z, then 




where dU/dn is the rate of change of U along the normal at any point 
on a surface described in the configuration space, and ds is an element of 
area on this surface. If the integration indicated in the first two in- 
tegrals is carried out over the whole configuration space (that is, with the 
limits = 3/ = 2==t:o), the values of U and dU/dn on the surface 
described at these limits must vanish, if U is a solution of the S. equa- 
tion. Also the integrand in the second term of this equation is evidently 
equal to (dU/dn) 2 , that is, to the square of the gradient along the 
diagonal of the element of volume dxdydz. Hence, we can express 
equation (31) in the form 



J* 



UV*Udxdydz = - (gr&dU) 2 dxdydz, (32) 

where the limits of integration are z = !/ = 2 = 00. 
In the case of the function 0(r t , r 2 , ri 2 ), 

r? - x\ + y\ + *?, 
r| = xl + yl + zl 

and r? 2 = (xi - x 2 ) 2 + (3/1 - 2/ 2 ) 2 + (*i - ^) 2 , 

where x\y\z\ are the coordinates of the first electron and a5 2 j/ 2 2 2 refer 
to the second electron. 

d<l> d<l> dr\ 

" - " " 

dx\ QT\ dx\ 



dri 7*1 
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Hence, for the first electron 



+ 



_ /** 

" \9rJ 



- y a ) 



and a similar expression may be derived for (grad 2 <) 2 , which refers 
to the second electron. 
In terms of the variables 5, , u, 

d4>_<tyjte ** j* ^ 2* 4. ?* . 
dr = ds " dri dt ' dri "" 3s d ' 



Hence, 



Combining equations (30) and (32) the integral to be minimized 
has the form 



fitted*)* + (grad2*) 2 }dr + 
< - - ^ - , (34) 



where dr - dxdydz. 
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According to equation (29), V is given by the relation 

__+_!_ 



4Zus - s* + * 2 

' 2 ' (35) 



From equations (33), (34), and (35) it follows that the integral to be 
minimized has the form 

I--\ (36) 



where X is the eigenvalue which appears in (12) and (17). 

t>\ 2 

+ 



L = /""ds f'du f *W(4Zws - s 2 + t 2 ), (38) 

t/O /0 /0 

and N = f ds f'du f U dtu(s 2 - * 2 )* 2 . (39) 

t/O t/O /0 

Owing to the fact that <t> 2 (s, i, u) = < 2 (s, <, w), and <t>H<t>(s, t, u) = 
<t>H<t>(s, t, u), the limits of integration have been taken as 
O^J^M^s^ oo, and this leads to the elimination of a factor 2 
in the expressions for M , L, and N. 

Equation (36) may be written in the form 

8(M - L - N\) = 0, ' (40) 

indicating that the expression in the brackets is to be a minimum. 

We now have to consider different forms of the expression for 0, and 
the simplest form is that chosen in the previous section, viz. : 

_ e -*('i+*i> = r k *> (41) 

where, for the present, the normalization factor may be omitted, and k 
is a parameter, the value of which has to be determined from the con- 
dition (40). 
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Since 



while 



r, /s 8 s 8 \ f 00 , 4 a 
.Ha" Is) 'Jo* 'II* 

r/8 /tt 

ds I du I di(4Zws - s 2 + < 2 ) 
/0 /0 

rr f tt 3 

ds / du (4Zt- a )it + 
/o I d 

rj f 4 ^ 5 1 4 
^iT-i^) 84 ' 



3* 30 
and = 



equation (40) reduces to 



The Euler equation which this variation problem must satisfy is 

_?:. 0, (43) 

' 



A I T 

d w 

where F denotes the function in the large brackets in equation (42). 
Since 

dF 8s* d^ 
Ib'ds 






dF 



therefore the Euler condition (43) corresponds to the differential equa- 
tion 



Ss^dV 8/d0 /4Z_,5_ 4XSN 

15 ds 2 ^ 3 d* \3 12 T 15 r 
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That is, ^ 5d4> / 5Z 25 



which is the corresponding S. equation. 
Substituting for <t> from equation (41), the result is 

...S+x+H a 

s s 16s 

Since this relation must be valid for all values of s, we must have the 
following two relations: 

fc = Z - ^ = f, since Z = 2 

and X= -fc 2 = ~( 

Hence, 



and 



= c 



which is the same form as equation (14), except for the normalization 
factor. 

As mentioned previously, the value thus derived for V ly the ionization 
energy of the helium atom, is 1.53 volts less than the observed value. 
This difference is due to the fact that the assumed eigenfunction leads to 
a distribution function for each electron in which the repulsive force 
between the electrons is not taken into account to a sufficient extent. 
The electrons do not move independently, and actually there must exist 
a high degree of improbability for the simultaneous occurrence of the two 
electrons in adjacent regions in the three-dimensional space. This 
phenomenon gives rise to an energy term which E. Wigner and F. Seitz 
have designated as correlation energy. 

In order to take this into account it is necessary to use functions which 
involve both t and u as variables. These functions should exhibit 
minima in the region u = and t very large. Thus the next approxi- 
mation used by Hylleraas has the form 

* = -*(! + mi* + a 2 t 2 ), 

where fc, ai, and a 2 are variable parameters which must satisfy the three 
conditions ^^.^o 

dk dai da 2 
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This yields a value for F which is only 0.033 volt less than the 
observed value. 
The next approximation is obtained by means of the function 



which reduces the difference to 0.0115 volt. 

To obtain the most accurate approximate, Hylleraas has used the 
function 

* = e-*'P(s, t, u), 

where P = A n ^ m s n t^ l u m } 

n,l,m 

that is, where is expressed as a series of terms in powers of s, t, and u. 
Under these conditions it is necessary to solve a number of equations 
of the form 




By using a polynomial with fourteen terms, a value was deduced for 
Vi which is within 0.0016 volt of the spectroscopic. 

11.4 The Problem of the Many-Electron Atom. The discussion 
in the previous sections indicates that, even in the case of the two- 
electron system, the problem of solving the S. equation involves a great 
deal of tedious calculation, if results of any considerable degree of 
accuracy are required. It is obvious that the difficulties encountered 
in solving the corresponding equation in the case of the many-electron 
atom must be even greater. In the following section some of the more 
outstanding methods which have been developed for the solution of this 
very complex problem are described quite briefly. The reader who 
desires to follow up this aspect of quantum mechanics will find more 
complete details in the references given at the end of the chapter. 

In any atomic system we may divide the electrons into two classes: 
(1) those in " inner shells," and (2) those in the outer shell, or valence 
group. It is only the latter which we need to consider, in general, in 
calculating the energy of the system, or the mode of interaction of one 
atom with other atoms. 

For these electrons we can apply the variational principle in the form 



where B is the zero-order eigenvalue expressed in ordinary units. 
The Hamiltonian operator for an atom with N valence electrons has the 
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form * 

where of = 8ir 2 p/h 2 , and the prime over the last summation indicates 
that the combination i, j is taken only for values of j > i. 

The number of terms in this summation is equal to the number of 
combinations of N things, taken two at a time, that is, 



(N - 2)!2! 

For the case N = 2, the last equation evidently becomes identical with 
equation (lO.lOa). If the " perturbation " terms in l/r# were absent, 
the solution would be the product of N eigenfunctions, one for each 
electron. Hence, the important problem in the solution of equation 
(44) is that of taking into account the repulsive forces between the 
electrons. In order to realize better the conditions which must be 
satisfied by any distribution function which shall adequately represent 
the behavior of the electrons in the system, it is necessary to consider 
more fully the significance of the terms in the expression for H. 

This may be separated into two parts: one corresponding to the 
kinetic energies of the particles, and the other representing the potential 
energy function. These are given by the relations 

2??, (45) 

and F-Sv. + S-t" (46) 



Ze* 
where V- = -- 

As in section 7.8, the mean value of the kinetic energy may be written 
in the form 



and the mean value of V is given by 

1 # | 2 Fdr. (48) 
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The latter integral may be regarded as the classical potential energy 
of the charge distribution p - | | 2 in the potential field V in the con- 
figuration space of 32V dimensions. 

Since the integral in (48) is negative in the case of stable states of 
atomic systems, the mean value of V is always negative, while that of 
T is positive. In order to obtain a minimum value (that is, a maximum 
negative value) for E, the total energy, it is therefore necessary to 
obtain such a representation for | < | 2 as will give a minimum value 
for T and maximum negative value for V. 

Now we can make T very small, by choosing such a form for $ as 
will make each of the functions | grad^ | 2 quite small This means 
that changes only slowly with variation in the coordinate variables, 
and therefore the distribution is spread out over a large region. The 
same conclusion is also evident from the point of view of the Principle 
of Indeterminacy. A high degree of uncertainty in the position of a 
particle corresponds to a fairly definite knowledge of the momentum 
of the particle. But such a distribution will overemphasize the effect 
of large values of rv in the expression for V, with the result that the latter 
will not have as large a negative value as it should have. Consequently 
E will not be as negative as possible. 

On the other hand, if the electrons are localized to any extent, this 
must correspond to very high rates of change in < with variation in the 
coordinates. That is, T is increased enormously, while V is made 
highly negative by letting | < | 2 have very large values in regions where 
ri is small This conclusion also follows from a consideration of the 
Principle of Indeterminacy, since precise knowledge of position can 
occur only if the momentum can vary over an extremely large range. 

When we consider the effect of the l/r# terms, it is evident from 
equation (46) that, if the electrons are localized in adjacent regions 
(for which r# is very small), the resulting value of V is made more 
positive, and E must therefore become more positive. Consequently, 
it is necessary to choose such a form for < as will make the terms in- 
volving 1/rij as small as possible. That is, the best form of distribution 
function will be that which represents each electron as tending to avoid 
the particular region in which any other electron is present. 

These remarks are equally valid in the quantum mechanics considera- 
tion of the problems of molecule formation and of the solid state. It is 
the necessity for satisfying apparently opposing conditions that makes 
the actual solution of the S. equation difficult in most of the cases which 
are of practical interest. 

Of the methods which have been developed for treating the problem 
of the many-electron atom, that of the self-consistent field developed by 
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D. R. Hartree 12 and that involving antisymmetrical functions utilized by 
J. C. Slater 13 are the most important. 

11.5 The Method of the Self-Consistent Field. The eigenfunction 
for the atomic system is represented as a product of single-electron 
eigenfunctions. These are obtained by a method which may be de- 
scribed best by considering its application in the case of the helium 
atom. 14 

Let us assume that the potential energy function can be expressed as 
the sum of two terms in the form 



where FI is a function of ri, 0i, and in, the coordinates of one electron, 
and F 2 is a function of the coordinates of the second electron. From 
a knowledge of these functions it should be possible to determine the 
eigenfunction for the system as the product to two single-electron 
functions <i(ri) and 02(^2)- But, in that case, the potential field 
effective for electron 1 is given (in Hartree units) by 

- --+ f I*2(r 2 )| 2 > (49) 

ri J r 2 

where the first term represents the attraction due to the nucleus, and the 
second term, the repulsion due to the averaged charge distribution for 
electron 2. 

That is, it is possible to calculate Fi(ri) by utilizing a plausible form 
for <te fas)- But, V\(r\) having been determined, it is possible to solve 
the S. equation 

(50) 



for the function <i(ri). This can then be inserted in an equation 
similar to (49) for the determination of V^fo), and gives the average 
potential field effective for electron 2. The function thus obtained 
can be inserted in a S. equation similar to (50) for the determination 
tefa)- The form of the function derived by this procedure should 
agree with that assumed in solving equation (49). From the extent 
to which this agreement is actually obtained by a first trial form for 
<fe fa) > it is possible to decide upon an improved form for the function, 
and the procedure is then repeated with the latter. This explains the 

12 D. R. Hartree, Proc. Cambridge Phil. 8oc., 24, 89, 111, 426 (1928), and sub- 
sequent papers in Proc. Roy. Soc. (London}, A. 

13 J. C. Slater, Phys. Rev., 34, 1293 (1929). 

14 This illustration is given by N. F. Mott, " Wave Mechanics," p. 120. See 
also H. Bethe, " Handbuch der Physik," XXIV/1, pp. 368-371. 
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reason for designating Hartree's method as that of the self-consistent 
field. 

V. Fock has shown 18 that if, in the general case, a product function 



is chosen and the functions <h(ri), etc., are varied individually to 
obtain the minimum value of the variational integral I <#<dr, then 

the same single-electron functions are derived as by the application of 
Hartree's method. 

J. C. Slater 16 has also examined Hartree's method critically as to its 
accuracy as a method of solving the S. equation and has pointed out the 
conditions under which the eigenfunctions and energy values thus de- 
duced might deviate to a considerable extent from the correct values. 
However, he has also shown 17 that in the case of the normal helium atom 
the method yields an electron distribution function which agrees well 
with that derived independently by Hartree and an ionization energy 
which is within 1 per cent of the observed value. Furthermore, Hartree 
and his associates have applied the method to calculate distribution 
functions for electrons in more complex atoms and have obtained results 
which are in satisfactory agreement with deductions from observations 
on the scattering of X-rays and of electrons by atoms. 

11.6 Slater's Treatment of the Many-Electron Problem. In accord- 
ance with Pauli's Exclusion Principle the complete eigenfunction for any 
atomic or molecular system must be antisymmetric. Consequently 
Hartree's function is not quite satisfactory. However, Slater has 
shown that an antisymmetric function may be built up out of single- 
electron functions in the following manner. 

Let 4>m(Xi9 Ui> *i) denote the eigenfunction for the electron having the 
quantum numbers n t -, Z$, m* and the coordinates of position Xi, y^ z$. 
The spin function will be designated by a;(oO, where o>; may have the 
values (|)/&/(2ir). The complete eigenfunction is given by 



where n t - designates the four quantum numbers n, I, m, and m a , and 
Xi designates the four coordinates, three of position and one of spin. 
A Hartree function would be obtained by taking the product of similar 

15 Z. Physik, 61, 126 (1930). See also Pauling and Wilson, "Quantum 
Mechanics/' p. 252. 

16 Phys. Jto., 32, 339 (1928). 

17 Phys. Rev., 32, 349 (1928). 
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functions for each of the N electrons in the system. This would have the 
form 

u(n 2 /x 2 ) . . . u(n N /x N ), 



and it satisfies the S. equation approximately. This, however, does not 
satisfy the exclusion principle. As Slater points out, "To build 
up an antisymmetric solution, we first note that we still have an ap- 
proximate solution, connected with the same energy value, if we inter- 
change any two x's, obtaining for example u(ni/x 2 ) u(n 2 /x\) . . . 
U(H N /XN). We still have an approximation with the same energy if 
we make a linear combination of any such solutions. Then we can 
make the one possible combination which is antisymmetric, and it will 
both satisfy the exclusion principle, and will be an approximate solution 
of Schrodinger's equation." 
This combination is written by Slater in the form of the determinant: 

U(HI/XI) u(ni/x 2 ) . . . U(UI/X N ) 
I u(n 2 /xi) u(n 2 /x 2 ) . . .u(n 2 /x N ) 
<fo = 7= ' (51) 

VWi 

u(n N /x 2 ) . . .u(n N /x N ) 



where 1/VW\ is the normalizing factor, if each of the individual func- 
tions is normalized. 

It is obviously antisymmetric, for interchanging, say, x\ and x 2 
interchanges two columns of the determinant, which by a familiar 
property merely changes the sign. It can be shown that it is the only 
antisymmetric combination of these functions, and it leads at once to 
the familiar interpretation of the exclusion principle. For, if two of the 
functions had the same quantum numbers (say HI = n 2 symbolizing 
equality of the four quantum numbers), then the corresponding rows 
of the determinant would be identical [since they contain the functions 
U(HI/XI) = u(n 2 /xi) = u(n 2 /x 2 ), etc.], and by another familiar rule 
the determinant would vanish. Thus, there is no solution correspond- 
ing to the case where two electrons have the same set of quantum 
numbers. Further, the determinant treats all electrons alike; hence we 
cannot count as separate two states which differ only by the inter- 
change of the quantum numbers of two electrons. Our exclusion 
principle then coincides with the one previously described. 

The Slater determinant can also be written in the form 

fc> = -4= 2 (-l) P JX*iM) u(n 2 /x 2 ) . . . u(n N /x N ), (52) 
V N\ p 

in which P denotes any permutation of the electron coordinates, and 
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( l) p SB +1 when P is even, that is, is equivalent to an even number 
of interchanges, while (- l) p = - 1, when P is odd. 

As Pauling and Wilson observe, 18 " The function 0o takes care of the 
degeneracy due to the N\ possible distributions of the N electrons in a 
fixed set of N functions, u. There still remains another type of de- 
generacy, due to the possibility of there being more than one set of 
spin-orbit functions corresponding to the same unperturbed energy." 
For these cases, a Slater determinant has to be set up for each set of 
spin coordinates, and the complete eigenfunction will be represented 
by the sum of two or more Slater determinants. The energy values are 
then obtained by solving a secular equation. 

Returning to the case in which only one Slater determinant is re- 
quired, the energy is given by the relation 



= J 4> #<MT<k), (53) 



E 

where dr is the element of configuration space and da) that of spin space. 
Assuming no interaction between spin and orbital functions, the last 
integral can be written as the product of two integrals, one involving 
dr, the other dco. Let ai(w;) denote the function for one direction of 
spin, and j8 t -(wi) that for the opposite direction. Then we have the 
relations 



I 



= 0. 



(54) 



In the case of the integral in (53) it is evident, since H does not 
operate on the spin functions, that the spin functions will occur as 
a? or j3? . Consequently, the integral involving dw is equal to unity, 
and we need to consider only the integral over the configuration space. 

Now let us consider the integral I faHfodr. This may also be written 
in the form 

i r r 

a J J 

Since each eigenfunction is the solution of a S. equation of the form 

\V 2 u<+ (Ei- Fi-K-^O, 
18 " Introduction to Quantum Mechanics," p. 233. 
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the first integral in (55) can be reduced to the sum of a series of integrals 
each of which is equal to E* / u^Vidri. 

The second integral in (55) will consist of two types of terms of 
which the first has the form 



\ W \' V L/ "!/ J \ vv \' v f/ w &/) j 7 /ff/3\ 

12 = I UTiaT2, (OO ) 

^12 



and the second has the form 

u(n 2 /x 2 ) u 



'''12 



(57) 



The integral J\ 2 represents the Coulomb interaction, while K\% cor- 
responds to exchange interaction. It is evident that these exchange 
integrals are not due to any new type of interaction between the elec- 
tronSy but arise from the fact that <o is expressed in the form of a 
determinant. 

As mentioned already, the degenerate case requires, for the complete 
solution, a number of determinantal functions, each corresponding to 
a definite set of spin functions. Let <t> a , <0, . . . designate these func- 
tions. It is then necessary to solve a secular equation of which the 
matrix elements have the form 



where 5 = 1 or 0. 

The actual computation of the energy levels in any given case is quite 
tedious, and, in view of the fact that complete details and illustra- 
tions of such calculations are given both by Slater and by Pauling and 
Wilson, it has not been considered necessary to discuss this topic at 
further length in the present connection. Moreover, there will be given 
in a subsequent chapter a calculation, based on the same methods, of 
the energy of interaction of two or more atoms, which will also serve to 
illustrate the principles involved. 

In concluding this brief summary of the methods which have been 
used for solving the problems of many-electron atoms mention should 
be made of the modification of Slater's method introduced by V. Fock, 19 
and of the publications by J. E. Lennard-Jones 20 in which the methods 
of the perturbation theory are applied. 

ig Z. Phyrik, 61, 126 (1930). 

Proc. Roy. Soc. (London), A129, 598 (1930), and Proc. Cambridge Phil. Soc., 
27,469 (1931). 
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In a more recent paper by P. M. Morse, L. A. Young, and E. S. 
Haurwitz 21 the variational method has been applied to calculate energy 
levels for a number of the simple atoms (He, Li, Be, B, C, N, 0, F, Ne, 
Na, and Mg). The forms of single-electron functions used are as 
follows: 



2p. 



The constant A is fixed so that u 2 is orthogonal to tii. Its value is 
given in terms of the two parameters a and 6, and N is a normalizing 
factor which is also a function of a and 6. " The parameter /x is a scale 
factor, whose best value can be determined analytically, leaving but three 
parameters to be determined numerically.'' These parameters a, 6, 
and c are then determined from the conditions 



"aa = 36 ^"dc "" ' 

which lead to minimum values for E. 

Tables are given in the original paper for the calculation of the cor- 
responding Coulomb and exchange terms, thus facilitating the cal- 
culation for any given electron configuration, when spin degeneracy is 
not taken into account. For calculating multiplet levels it is therefore 
necessary to form linear combinations of Slater determinants as men- 
tioned in a previous section. 

*PAy.lto.,48,948 (1935). 



SUPPLEMENTARY NOTE 1 
THE EXPRESSION FOR THE OPERATOR H IN ATOMIC UNITS 

For an atomic system of nuclear charge Z and N valence electrons, 
the S. equation in terms of atomic units has the form, which is analogous 
to equation (12), 



where the summation symbols have the same significance as in equa- 
tion (44). 

Since the eigenvalue X for this system must satisfy the condition 



/ $H<t>dT = XJ fod 



it follows that the Hamiltonian operator is given, in terms of atomic 
units, by the relation 



Thus the introduction of the factor \ in equation (30) is due to the 
particular choice of units in which the energy is expressed. 

COLLATERAL READING 

1. For discussion of the variational method see the following: 

BETHB, H., " Handbuch der Physik," XXIV /I, p. 354. 
PAULING, L., and WILSON, E. B., " Introduction to Quantum Mechanics," 
Chapter VII. 

2. The problem of the many-electron atom: 

PAULING, L., and WILSON, E. B., Chapter IX. 
SLATER, J. C., Phya. Rev., 34, 1293 (1929). 
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CHAPTER XII 
THE HYDROGEN MOLECULE 

12.1 Heitler-London Method. The problem of determining the 
solution of the S. equation for the helium atom is a limiting case of 
the problem which is presented by the hydrogen molecule. If in the 
latter we permit the two nuclei to approach until they coalesce, we ob- 
viously obtain the same system as that of the helium atom. Conse- 
quently, we expect to find in the solution of the S. equation for the 
hydrogen molecule certain similarities with the results obtained in the 
previous chapter. However, there is one feature of the hydrogen mole- 
cule which is of extreme importance, and which does not occur in the 
case of the helium atom. This concerns the interpretation of the 
valence bond, which, on the basis of the Lewis-Langmuir theory, is 
regarded as due to the sharing of two electrons between the two hydro- 
gen nuclei. The first successful application of quantum mechanics 
to the solution of the problem of the hydrogen molecule was made by 
W. Heitler and F. London, 1 and although subsequent investigators 

have developed methods of attacking the 
problem by which more accurate results have 
been obtained, we shall find it advantageous 
to discuss tho Heitler-London (HL) theory in 
some detail before describing some of the 
other methods. 

T We consider a system consisting of two 

FIG. 54. Illustrating nota- nuclei A and B, and two electrons 1 and 2, 
tion used in formulating as indicated in Fig. 54. In the unperturbed 
potential energy function gtate> where the two atoms are qu i te separate, 
for interaction of two by- ^ fe obvious ^ ^ E = ^ h 

drogen atoms. . &J U; . 

EQ is the energy of the hydrogen atom in the 

normal state, as given in Chapter VII. For electron 1 attached to 
nucleus A and electron 2 to nucleus B, the eigenf unction is given by 

(1) 




1 Heitler and London, Z. Phyrik, 44, 455 (1927). 
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where 



c "o 



(2) 



and TAI and r B2 refer to distances indicated in Fig. 54. 

But corresponding to the same energy 2E G , it is also possible to have 
the eigenfunction obtained by interchanging the electrons, that is, 

(3) 



(4) 



Consequently, the system is degenerate and the zero-order eigen- 
function should be represented, as in the case of the excited helium atom, 
by the two linear combinations 

where a, !>, c, and d are constants. 

The eigenfunctions representing to first-order perturbation terms, 
the perturbed state, that is, the state in which the two atoms are inter- 
acting, will be given by 

and 

<fo ~ </> + fo, (8) 



while the corresponding eigenvalues will be 

XT' _ OIT* i 
J^ a &&Q "T *lc 

and 



(9) 



(10) 



These will represent the eigenfunctions and eigenvalues corresponding 
to the S. equation : 



or 



(lib) 
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where the operator H is defined by the relation 

and 



(12) 



e 2 e 2 



(13) 



ri FAI r B2 rsi * r A 

The suffixes on the Laplacian operators refer to electrons 1 and 2. 
Since <& and ^ must be orthonormalized functions, it follows that 



and 



/' 



(14) 



(15) 



where the integration is carried out over the whole configuration space 
for each electron. 

By substituting in these last two equations from equations (5) and (6), 
it follows that 

a 2 + 6 2 + 2abS 2 = 1, (16) 



c 2 H- d 2 + 2cdS 2 = 1, 

ac + bd + (ad + 6c)<S 2 - 0, 



(17) 
(18) 



where 



/ 



A (1 )B (2)t*A (2) MB (1 )dvidv a . 



(19) 



Now it is evident that for R = , that is, for the unperturbed state, 
S = 0, since either fa or fa represents the two separate atoms. On the 
other hand, for R = 0, as mentioned already, the system becomes identi- 
cal with that of the normal helium atom, for which 



1. 



HEITLER-LONDON METHOD 313 

Hence, in the case of the hydrogen molecule, 1 > 5 > 0, and since 
in the case of helium, the zero-order eigenfunctions for the excited 
states are 



and 



we assume a = 6. The validity of this assumption will be justified by 
the resulting deductions. 
From equation (16) it follows that 

vWT*' (20) 



Also, from equation (18), it follows that 

b(e + d)(l + S 2 ) * 0. 
Since 

b^O and S 2 9* -l,c - -d. 

Hence, by substituting in equation (17), we deduce the result 

1 -1 

d 



so that the orthonormalized functions are as follows : 

( *' + ; <22) 



It is seen that <#J is a symmetrical, and <$ an antisymmetrical, function. 

By direct substitution of these relations in equations (14) and (15) 
it is readily shown that the latter conditions are satisfied, so that equa- 
tions (22) and (23) represent orthogonal and normalized functions. 

From symmetry considerations it is also evident that 

S. (24) 



314 THE HYDROGEN MOLECULE 

Furthermore, these four eigenfunctions satisfy the four 8. equations: 



and 
Hence, 

and 



V 2 M B (2) 
V 2 B (1) 

V 2 A (2) 



Ff )u B (2) - 
Ff )tfc(l) = 







(25) 



= A (1)V 2 B (2) 
= -K(2E + 7f 



(26) 
(27) 

(28) 

Now if < a is a solution of the S. equation (lla), then, by substituting 
from equations (7) and (22), we obtain the equation 

Ft TT A i TrB 
+ 



0. 




7? 



(29) 



From (29) and (28) it follows that 




(30) 

This is an inhomogeneous partial differential equation of the same 
type as that encountered in equation (9.11) in connection with the cal- 
culation of the first-order perturbation energy term. In order that it 
may have a solution, it is necessary that the right-hand side of equa- 
tion (30) should be orthogonal with respect to the solution <fo + < 2 of 
the corresponding homogeneous differential equation. 

Hence, 



(F - *) (2 



(31) 
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That is, 



where 



(32) 



2E U 



!/12 



/(2F -V$-V?-V$- 



(33) 
(34) 



and S 2 is defined by equations (19) and (24). 

In a similar manner it may be deduced that the first-order perturba- 
tion energy term corresponding to is given by the relation 2 



EH 



1- S 2 



(35) 




12.2 Physical Interpretation. The terms rj a and ^ represent, to a 
first-order approximation, two different valws for the interaction energy 
of two hydrogen atoms, and an inspection of 
equations (22) and (23) shows that ri a corre- 
sponds to the symmetrical zero-order eigenf unction 
<t%, while Wg corresponds to the antisymmetrical 
function <$. Thus the quantum mechanics treat- 
ment of the problem leads to the conclusion that 
two hydrogen atoms can interact in two different 
modes. The fundamental reason for this is the 
fact that it is possible for the two electrons to 
interchange places; or, stated in more technical 
language, the existence of two modes of vibra- 
tions and corresponding eigenvalues is due to the 
degeneracy of the system in the unperturbed 
state. This, in turn, occurs because the two 
electrons are absolutely equivalent, so that it is 
impossible to distinguish between them. In 
other words, the whole argument is a logical 
deduction from the Principle of Indeterminacy in the sense that, when 
two hydrogen atoms approach each other very closely, there is no 

2 The notation #n and #12 is that used by Heitler and London. In a subsequent 
section these will be shown to correspond to matrix elements which occur in a second- 
degree secular equation. 



1 2 3 
R/ao-^ 
10. 55. 
for the 
interaction of two 
hydrogen atoms; 
curve S corresponds to 
the symmetric, and 
curve A to the anti- 
symmetric mode. 
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80- 
60- 



experimental method by which each electron may be " tagged " and 

observed. 
Evidently the interaction energy terms rj a and ^ are functions of R. 

While the actual details of evaluating the expressions for En and E\^ 

are discussed in a subsequent section, Fig. 55 shows the results deduced 

by Y. Sugiura, 3 which are more 
accurate than those obtained 

*~ * * by Heitler and London in their 

investigation. The values of 
ria (curve S) and rjp (curve A) 
are given in electron-volts 
(v.e.) 4 as functions of the 

201- x internuclear distance n '~ 

where an denotes the Bohr 

Ql 'IX -1 ' ! l^^"""- I ^ fm I I V 

1 rl v - ' unit radius. 

It will be observed that t\ a 
reaches a minimum value of 
-3.2 v.e. for R = 1.52a = 
60 

I V S I f\\ _.J ^_ ...__. /%.._.,> l\.t~.~~.~.\ 

positive. Thus, i; a represents 
the lower energy state and must 
correspond to molecule forma- 
tion, while ty, since it is posi- 
FIG. 56. Plots of the total energy, Coulomb tive, must correspond to re- 
energy and exchange energy as functions pu i s i on between the atoms. 5 

Consequently, the symmetric 
eigenfunction <t> a represents a 
stable state, while the antisymmetric eigenfunction 00 represents an 
unstable state. 

Figure 56 shows plots of the same quantities in terms of calories per 
mole H 2 . For comparison there are also plotted the energy term 
En and the curve calculated by P. M. Morse 6 from observations qn the 
band spectrum of H2. These observations lead to a minimum of the 
curve at R = 1.40a = 0.74 A. The considerations upon which this 
calculation has been made will be discussed in the following chapter. 
Calculation shows that the term E\% is negative (over a large range of 

3 Sugiura, Z. Physik, 45, 484 (1927). 

4 1 v.e. - 23,055 cal./mole. 

6 Thus, curve S is similar to the plots shown in Fig. 51, Chapter VIII, for the 
energy of interaction of two molecules. 

6 Morse, Phys. Rev., 34, 67 (1929); Condon and Morse, " Quantum Mechanics," 
p. 163; also see the following chapter for further discussion. 
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of internuclear distance, for the two modes 
of interaction of two hydrogen atoms. 
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values of R/a) and greater in absolute value than JE?n, while S 2 is posi- 
tive and less than 1 for all values of #/a > 1. From the curves in 
Fig. 56 it is seen that for values of J?/a ^ 1.5, approximately, the rela- 
tions between i? a , ^, E\\, and EI% are those indicated diagrammatically 
in Fig. 57. 

A consideration of equations (33) and (34) throws considerable light 
on the nature of the attractive energy forces which lead to molecule 
formation. In equation (33), the term F (4>i + <t>$) _^__^_ 
represents physically the total repulsive energy due ^ g2) t 

to the nuclear charges and the electronic charge p ft 



. 

distributions. Similarly, the other terms on the i \ 
right-hand side of (33) represent the attractive ftrU+S > 
energy between the nuclei and the electronic charge 



distributions. Hence, we may designate EH as the FIG. 57. Relation 

Covlomb interaction energy. On the basis of classical between the 

considerations this should constitute the whole of the terms in the ex- 

interaction energy between two hydrogen atoms. pressions for the 

A x i i J ^ r ET u energy of inter- 

Actual evaluation of the expression for E\\ shows action of two 

that this has a minimum value of 0.488 v.e. for hydrogen atoms. 
R = 1.90a = 1.00 A. Since the value of S 2 for 
this value of R is 0.347, it Mows that -#n/(l + /S 2 ) = 0.362 
v.e. = 8350 cal./mole. Comparing this result with the observed energy 
of 4 formation of H 2 , which is 4.72 v.e. = 108,900 cal./mole, it is seen 
that the classical electrostatic attraction and repulsion energies are quite 
inadequate to account for the energy of the valence bond. 

As mentioned already, the energy of formation as calculated by 
Sugiura is 3.2 v.e. Though this is less than the observed value, never- 
theless it indicates that the valence energy is accounted for to a con- 
siderable extent by the term J?i2. What is the physical significance 
of this energy term? A similar expression was encountered in the ex- 
change-energy term V\% which was derived in the solution of the helium 
atom problem. In the present case we also designate E\% as the exchange 
energy, and it is seen that the presence of this term is due to the assump- 
tion that the eigenf unction for the system has the form given in equations 
(5) or (6). In other words, Z?i 2 occurs because of the possibility of 
interchange of the electrons, and we may regard vi 2 = E^/h as a measure 
of the frequency of this interchange. 



A great deal has been written about the non-classical nature of the term 
and since this term accounts, as shown above, for a large part of the energy of for- 
mation of H2, a distinction has been drawn between the types of forces involved in 
the two energy terms (En and #12). Evidently, such a distinction is only the re- 
sult of the mathematical computation, for, as a matter of fact, the quantum-mechani- 
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cal treatment recognizes that the only forces involved in the binding of two hydrogen 
atoms are those which arise from electrostatic attraction and repulsion between the 
four particles which constitute the system. The exchange term is merely an expres- 
sion of the physical requirement that the electrons in H2 cannot be regarded as 
localized about the nuclei with which they were associated in the separated atoms. 7 

There is this important difference between the energy terms Fi2 
and EM, that, whereas Ei 2 is negative in the case of the hydrogen 
molecule (and in most other cases of interaction of similar atoms), the 
term Fi2 is positive, as is readily evident from equation (10.2$) and the 




FIG. 58. Electron distribution for elastic reflection of two hydrogen atoms. 

subsequent calculations in Chapter X. This is due to the fact that 
F 12 represents a repulsive energy term which arises from the inter- 
change of coordinates by two electrons of different quantum numbers 
in the same atom. On the other hand, Ei 2 represents an attractive 

energy which occurs because of the 
possibility of interchange of two 
equivalent electrons, one from each 
of the atoms. 

On the basis of these consider- 
ations Heitler and London con- 
cluded that, to a first approxima- 
tion, the energy E\z corresponds 
to the energy of the valence bond 
in the molecule H 2 . This is most 
readily evident from the plots, cal- 

culatedfrom (02) 2 and (02) 2 ,ofthe 

, .. - , ,. . - L 7- * *L 

derm '2> / char V e distribution for the 

symmetric and antisymmetric cases, 

respectively. Figures 58 and 59, taken from a paper by F. London, 8 

show the results obtained. 
The density is constant on each curve (so that they correspond to 

the isobars in atmospheric pressure measurements). The numbers 

7 S. Dushman and F. Seitz, J. Phys. Chem., 41, 233 (1937). 

8 F. London, Leipziger Vvrtr&ge, 1928, pp. 59-84. 
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FIG. 59. Electron distribution for hy- 

drogen molecule formation. 
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attached to each curve give the relative densities or probabilities of 
occurrence of the electrons. Figure 58 illustrates elastic reflection and 
shows that the electrons tend to occur during most of the time in the 
regions removed from the center. They avoid the region between the 
nuclei. On the other hand, Fig. 59, which represents the distribution 
for molecule formation (homopolar combination) , shows that in this case 
the electrons tend to occur during most of the time in the region between 
the nuclei. Thus, the methods of quantum mechanics lead to an inter- 
pretation of the shared electron pair or non-polar bond of the Lewis- 
Langmuir theory of valency. When the chemists write the electronic 
formula for H2 as H:H, they are therefore in logical agreement with the 
conclusion deduced from the solution of the S. equation. In other 
words, the theory of Heitler and London gives a quantitative basis for a 
representation which the chemist had derived by intuition. 

12.3 The Application of the Pauli Exclusion Principle. Here, as in 
the case of the helium atom, we must take the electron spins into 
consideration. Here also we find that it is possible to obtain only four 
functions which are completely antisymmetrical in the electrons and 
which, therefore, satisfy the Pauli Exclusion Principle. 

Let us designate the two spin functions ^(|) and ^( J) by a and 
0, respectively. Then the only completely antisymmetrical functions 
which can be formed from the spatial functions <t% (symmetric) and <$ 
(antisymmetric) are the following: 

(JX - 0) 



8 - 0) 



and fc|8(l)j8(2). (IX = 1) 



Only the first of these four functions involves 0, and in that case, 
IX = 0; that is, the spins are antiparallel. In the other three 
functions the total spin has the values +1,0, and 1. 

Thus we find that there exist three possible states in which the 
atoms repel each other and one state in which they attract and form a 
molecule. That is, when two hydrogen atoms collide there is a 25 
per cent probability that this collision will result in the formation of a 
molecule. Furthermore, in the molecule, the spins of the two electrons 
must be antiparallel. For this reason the normal state of the molecule 
is designated spectroscopically as a singlet state CI^), whereas the 
repulsive state is of the triplet type (*]). 
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The suffixes g and u signify respectively " gerade," that is, even, and 
" ungerade," that is, odd. These designations refer to the fact that 



while 



Although the energy state corresponding to fy is unstable, " it is used 
to explain a certain continuous spectrum emitted by Efo, due to transi- 
tions from an excited triplet state, not shown in Fig. 55, down to the 
3 L curve. Transitions down to this curve would obviously lead to a 
dissociation of the molecule, giving rise to a continuous spectrum." 9 

12.4 Calculation of Perturbation Energy Terms. Let us consider 
first the integral 

(24) 



i r _*L 

= -~ I a o dvL (36a) 

irag J 

We shall find it advantageous to express all distances in atomic 
units, and designate these by p, with a corresponding subscript. Thus 
PAI r Ai/ a o> and so forth. In terms of these units, 

S-- fe^Ai+Pn)*!. (366) 

Now, in evaluating integrals such as those for S, EU } E\%, and others 
which occur in two-center problems, it is found convenient to utilize 
spheroidal co&rdinates of the particular type known as confocal elliptic. 
These are defined as follows: 

A = ~(pAi + PBi)> (37a) 



and ** ~D ^ PA1 " 

where D = R/a . l As third coordinate, we use the angle 0, which a 
plane passing through the two nuclei and the instantaneous position of 

9 J. H. Van Vleck and A. Sherman, " The Quantum Theory of Valence," Rev. 
Modem Phys., 7, 167 (1935). This is also discussed in the following chapter. 

10 The significance of X and M in these equations is, of course, not to be confused 
with the interpretations used in other connections. 
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the electron makes with a fixed plane through the two nuclei. It will be 
recognized that X defines an ellipse described about the two nuclei as 
focal points and passing through the point designating the position of 
electron 1. Similarly p, defines a hyperbola which passes through the 
same point. By revolving such a system of confocal ellipses and 
hyperbolas about the line joining the nuclei as axis, there is obtained a 
set of confocal ellipsoids and hyperboloids, which (see Fig. 61 in sup- 
plementary note 1) intersect along circles. These circles lie in planes 
perpendicular to the axis and have their centers on this axis. Hence, 
in order to specify the position of the electron, we require not only X 
and /* (which define a particular circle) but also 0, the angle which a line 
passing through the center and the point makes with a fixed axis. 
As shown in supplementary note 1, the element of volume is given by 



dr _ - (X 2 - /x 2 )dXdM<#. (38) 

o 

Hence, in terms of these coordinate variables, 



D 3 r 
= T J r 



D\/\2 



(X 2 - 



since the limits of are and 2ir, and we can therefore integrate directly 
with respect to this variable. 

The limits for the other variables are defined byl<X<oo; 1 < 
/i < 1. Expressing the integral as the product of one integral with 
respect to X and of the other with respect to /*, 



7)3 / f /! 

T/ ^"* / tffc 

4 Ji IJ-i 



where x = D\. The expressions for these integrals are given in Ap- 
pendix III. Consequently, we obtain the result 

(39) 
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Figure 60 shows a plot of S 2 as a function of D. (In the figure this 
ratio is designated p.) 

We shall now consider the relation for EH as given in equation (33). 
The first term on the right-hand side is 




and since the two electrons are equivalent it follows that we can write 



where 



1.2 

1.0 

.8 

i .6 

.4 

.2 



.4 .8 1.2 1.6 2.0 2.4 2.8 3.2 3.6 
/>R/a 

FIG. 60. Plot of the function S 2 as a function of intcrnuclear distance. 

This integral represents the repulsive energy between the two electronic 
charge distributions represented by u\(\) and w|(2), that is, by the 
two functions 



4(i)=- f r*' 

vJ 



and 
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Now according to equation (10.18), the potential at the point p = p A1 
= P A , due to the electron distribution about the point B, is given by 

6 (1 - 6- 2 >B (1 + PB )}, (40) 



where p B represents p B1 . 

Hence the interaction energy of the charge distribution about the 
point B with a similar distribution about the point A is given by 



a / p B 



PB 

Introducing the confocal elliptic coordinates defined by equations (37a) 
and (376), and integrating with respect to 6, the last equation assumes 
the form 

"~~ DM * (X 2 - , 



(X 



.2 r>2 f /to 

--T / 

a 2 [Ji 



a [D \JD 

i /c 



/oo /D \ 

I *~* dx I *~* dx l 

JD J-D / 



8 4 

In the case of the four negative terms on the right-hand side of 
equation (33), which involve respectively the four reciprocal distances 
PAi"" 1 * P*2~ l > P&2~ 1 > an( * PBi"^ 1 ) i* follows from the equivalence of the 
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electrons that each of these terms represents the attractive energy 
due to interaction of an electron charge distribution about one of the 
nuclei, say A, with the positive charge at the other nucleus B. Denot- 
ing each of these four terms by J%, it follows from equation (40) and 
the relation R = a Dthat 



(42) 
Consequently, 

Sii=^+/i-2J 2 (43) 

1 

e* <-*>[> 3D 2 D 3 ! .... 

-S-DT+TT-T}' (44) 

From the plot in Fig. 56 of E\\ against D it will be observed that for 
D > 1.4 approximately, the expression has a negative value. This sig- 
nifies that, for distances beyond D = 1.4, the Coulomb attractive energy 
between the nuclei and the negative charge distributions exceeds the 
total repulsive energy which exists between the two nuclei and also 
between the two electronic charge distributions. The Coulomb forces 
of attraction and of repulsion are equal at the distance D = 1.9, approxi- 
mately, for which dEn/dD 0. 

We now have to evaluate the different integrals which occur in equa- 
tion (34) for the energy JE/i 2 . Because of the equivalence of the elec- 
trons, it follows that the four negative terms must each be equal to the 
same integral. Let us consider the integral 



f F 
J 



where 

*2 /. tf -(p A1 +P B i) 

vi. (45a) 



r 

I 

J 



PAI 



Using the transformation to confocal elliptic coordinates defined by 
equations (37) and (38), it follows that 



GO 2 i -\ 



(456) 
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and therefore, 

(46c) 



3 3 

The first integral on the right-hand side of equation (34) is 

/e 2 
F i<MMt>2 - -g S 2 + K l9 

where e 2 ^ / j \ 

1 - I 01*2 ( ) dvrfvt. (46, 

00 ^ \Pl2/ 

Consequently, we obtain the result 

#i2 = e 2 -f + K 1 -2S 2 . (47) 

/L 

Heitler and London did not evaluate the integral in equation (46), 
but concluded that 



However, Y. Sugiura 11 showed, as a result of a lengthy calculation, 
that the integral could be represented by the relation 



j:\S 2 (C + In D) + S?ffi(-4D) - 2SSiJK(-2D)l 1 (48) 
where C = Euler's constant = 0.5772, 



In = natural logarithm, 

/ a; c ~ u 

Ei(x) =, integral logarithm 12 =1 du, 

Jx> U 



11 Sugiura, Z. P%fc, 45, 484 (1927). 

12 Values of this function are given in " Funktionentafeln," by E. Jahnke and 
F. Emde, B. G. Teubner, Berlin, 1928, pp. 19-22. Also see Appendix III. 
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As a result of his calculation, Sugiura derived a minimum value for the 
interaction energy ri a of 3.2 v.e. for D = 1.4 (R = 0.80 A), which is 
about 0.8 v.e. lower than that obtained by HL, but still above the 
observed value 4.72 v.e. 13 

12.5 The Variational Method. The HL method yields only an ap- 
proximate value for the energy of formation of H2 from the atoms; 
more accurate results have been obtained in the solution of this problem, 
as in that of the helium atom, by the application of the variational 
method. As a first illustration of this, it is interesting to observe how 
much more readily the eigenvalues may be derived in this manner as 
compared with the rather tedious calculation by HL in which the 
perturbation method was used. 

As shown in the previous chapter, we can write the condition for the 
solution of the S. equation in the form 

i<t>H<t>dr 
E = - - = minimum, (49) 



where H is defined by equation (12), and the value of E will be lower 
(more negative), the more nearly the function <t> approaches the correct 
eigenfunction for the given state. 

As a first approximation let us assume that the zero-order function 
is represented by the Heitler-London function < defined by the relation 

* = a<h + 60 2 , (50) 

where <t>i = 



and WA(!)> etc., are defined by equations (1) and (2), while a and 6 
are parameters to be determined from the conditions 

*?_^?_ o 

da ~ db ~ 

Since <t>\ and fa are orthonormalized functions; it follows that equa- 
tion (49) assumes the form 

(a 2 + b 2 + 2abS 2 )E = a?H n + ab(H l2 + H 2l ) + 6 2 ff 22 , (51) 



18 The energy of formation of H2 from the atoms is, of course, +4.72 v.e. This 
value is greater than the observed energy of dissociation of H2 because the latter does 
not include the " zero point " energy of vibration of the nuclei, as will be discussed 
more fully in the following chapter. 
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where S 2 is defined by equation (19), and 

#11 = 

HlZ = / 
#21 = 

#22 ~ t 
Because of the equivalence of the two electrons, it is evident that 

#12 = #21 
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(52) 



//22 

Hence, equation (51) becomes 

(a 2 + b 2 + 2abS*)E = (a 2 + b 2 )H n + 2abH 12 . 

Differentiating with respect to a and 6 separately, we derive two 
relations which must be satisfied. These are 



(o 2 



and 



(a 2 + 6 2 



da 



ob 



- E) 



- S 2 E) = 0, 



- S 2 E) + 2b(H n - E) - 0. 



Consequently, we derive the determinantal relation 

11 ~ E) #12 ~~ S E 

12 S E 9 tin E 



that is, the secular equation, 



= 0. 



The roots of this equation are, evidently, 

_ #n + #12 



and 



#11 



(53) 



(54) 



(55) 
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From equations (12) and (26), it follows that 



f ( 

"J Vl2 



+ + e 

rt 

+ 
BI 

+ Ji-2J a> (56) 

where Ji and J 2 are expressions defined by equations (41) and (42), 
respectively. Comparing with equation (43), it is seen that 

2# + #11- ( 57 ) 



In the same manner it is shown that 
#12 



-e 2 ff + 
J \r A i 7-B 



(58) 

where K 2 and KI are the expressions defined by equations (45o) and (46), 
respectively. Comparing with equation (47 ) , it follows that 

(59) 



Hence, 

E B - 2E - "" - [see equation (32)] 
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while 



n _ 

E A - 2E - **_ - ty [see equation (35)] 



(61) 



These results are identical with those derived in the previous section. 

12.6 Method of Molecular Orbitals. The fact that the Heitler- 
London method leads to a binding energy of only 3.2 v.e. as compared 
with the observed value of 4,72 v.e. shows that the simple assumption 
made by Heitler and London for the form of 0o requires considerable 
modification. 

In seeking for an explanation of the discrepancy between calculated 
and observed values, it is evident that one cause is the neglect of the 
mutual polarization of the atoms. As mentioned in the following 
section, N. Rosen has shown that, when this factor is taken into account, 
the value deduced for the binding energy is 4.02 v.e. 

Further consideration shows that in the Heitler-London method two 
very important effects have been neglected. 14 In the first place no 
allowance has been made for the probability of the simultaneous occur- 
rence of both electrons near the same nucleus. This would be indi- 
cated by the presence in the eigenfunction of ionic terms. " When 
they are included/' as Penney remarks, " they must be introduced 
symmetrically into the wave function in order that there is just as 
much probability of finding both electrons on the one proton as the 
other. If this were not done, the assumed wave functions would 
attribute a permanent dipole moment to the hydrogen molecule." 

In the second place the Heitler-London method fails to take into 
account to a sufficient extent the fact that the two electrons will tend 
to avoid each other, so that the probability of the occurrence of one 
electron at any point in the configuration space must be a function of 
r*i 2 . The additional energy of binding which results from this effect 
is the correlation energy, and while it is included to some extent in the 
exchange term Ui 2 , the actual calculation of this term presents con- 
siderable difficulty. 

The difficulties arise, of course, from the fact that in the Heitler- 
London method the zero-order eigenfunctions for the molecule are built 
up from atomic orbitals, that is, single-electron wave functions which 

14 W. G. Penney, "The Quantum Theory of Valency," p. 18; S. Dushman and 
F. Seitz, toe. ctt. 
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describe the behavior of an electron in the field of only one proton. 
It is obvious, however, that, when the atoms combine to form a hydro- 
gen molecule, these atomic orbitals will be profoundly modified, and it 
might therefore appear more logical to start with a system consisting 
of two nuclei at a given distance, to which the two electrons are added 
in succession. That is, we can regard H 2 as formed from the ion H by 
the addition of an electron. 

This method which has been developed by F. Hund, R. S. Mulliken, 15 
and others is known as that of molecular orbitals. In this method 
the wave function expresses the motion of each electron in the potential 
field resulting from all the nuclei and the other electrons present in the 
molecule. The point of view is therefore analogous to that of Hartree 
in the case of atomic systems. 

A comparison of the results obtained by the two methods has been 
made by Van Vleck and Sherman, and their conclusion is as follows: 16 

It is hard to say categorically whether the method of molecular orbitals or the HL 
method is the better. The latter undoubtedly is much preferable at very large 
distances of separation of the atoms, for then the continual transfer of electronic 
charge from one atom to another demanded by the ionic terms surely scarcely occurs 
at all. On the other hand, at small distances, the HL method probably represents 
excessive fear of the r\2 effect, and the factorization into n one-electron problems 
presupposed by the method of molecular orbitals may be quite a good approxima- 
tion. 

The reference to " ionic terms " requires further amplification, and, 
in order to illustrate the significance of this point, wo shall consider the 
application of the two methods to a diatomic molecule AB in which 
each atom contains only one electron which is effective for interaction. 

In the molecular orbital method, we represent the eigenf unction for 
the molecule by 

(62) 



where a capital ^ is used, as Van Vleck and Sherman have suggested, 
to denote the wave function for the whole system, and small ^ for the 
function of each electron moving in the field of the two nuclei and the 
other electron. 

Let <A denote the wave function for the motion of one electron in 
the field of the nucleus A (so that <A is an atomic orbital), and similarly 
for B . 

15 R. S. Mulliken, Rev. Modem Phys., 4, 1 (1932). See also J. H. Van Vleck and 
A. Sherman, ibid., 7, 167 (1935) and W. G. Penney, op. tit., Chapter III, for a dis- 
cussion of this method. 

16 Van Vleck and Sherman, op. tit., p. 171. 



METHOD OF MOLECULAR ORBITALS 331 

Then equation (62) becomes 



(63) 

It will be noted that a term such as A (1)0 A (2) implies that both 
electrons are associated with nucleus A, while <te(l)4>B(2) means that 
both electrons are associated with B. Therefore, the right-hand side 
of (63) represents a condition which may be described as follows: 

(1) The system has a probability of being in either or both (depend- 
ing upon the ratio a/0) of the ionic states A: B or A :B. These corre- 
spond to the designations ATB+ and A + B~, respectively. 

(2) There is a probability, defined by a 2 /? 2 , that the molecule will 
exist in the hwnopolar state A : B, in which the bond is non-polar. 

The HL argument takes into account only the second possibility and 
neglects completely the possible occurrence of ionic states. 

Obviously the actual wave function should, in general, be repre- 
sented by a combination of ionic and homopolar functions, and for 
purposes of further discussion we may write equation (63) in the more 
compact form 

* = ah + C^HL, (64) 

where ^ designates the wave function for the ionic states, ^HL that for 
the homopolar state (HL form of function), and it is necessary that 

a 2 + c 2 = 1. 

From this point of view, the distinction between ionic and homopolar 
compounds is not nearly so sharp as we ordinarily assume. As a in 
equation (64) varies from 1 to 0, we pass from the completely ionic 
to completely homopolar type of molecule. In this connection the 
comments of Van Vleck and Sherman are very pertinent. They write: 

There are elements of truth in the old-fashioned chemistry that HC1 has the struc- 
ture H + C1~~, as the true wave function of HC1 is expressible as a linear combination of 
various idealized types, and certainly H+ Cl~ must be given some representation. . . . 
One great service of quantum mechanics is to show very explicitly that all gradations 
of polarity are possible, so that in a certain sense it is meaningless to talk of such 
idealizations as homopolar bond, heteropolar bond, covaient bond, dative bond, etc. 17 

The reader will find further interesting remarks on this topic in 
Pauling's paper, 18 " The transition from one extreme bond type to 

17 Van Vleck and Sherman, op. tit., p. 171. The author is responsible for the 
italicized parts. 

18 Pauling, J. Am. Chem. Soc., 64, 988 (1932). See also the discussion by 
L. Pauling and E. B. Wilson, Jr., " Introduction to Quantum Mechanics," p. 345. 
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another/' in which he uses as illustrations of such transitions from 
ionic to homopolar molecules the alkali and hydrogen halides. 

12.7 The Dissociation Energy of H 2 . These considerations have been 
found to be important in the calculation of the energy of formation 
of H2 from the atoms. (This will be designated by D e .) For a sym- 
metrical molecule, such as H 2 , we usually regard the value of a in equa- 
tion (64) as insignificant. However, S. Weinbaum 19 has shown that it 
is possible, by using a function of the form given in this equation, to 
obtain a more accurate value of D e . He assumed 

*B(1)*B(2)] + c 



and similar expressions for the other hydrogen-like functions. This 
expression for ^ was then used in the fundamental relation (49) to cal- 
culate a value of E, subject to the conditions dE/dc - and dE/dZ = 0. 

The maximum value for D e obtained in this manner was 4.00 v.e., with 
the values c = 0.256 and the effective nuclear charge Z = 1.193. 

E. A. Hylleraas 20 also used the method of molecular orbitals, start- 
ing with wave functions which are solutions for the ionized hydro- 
gen molecule H. The value thus derived for D e was 3.6 v.e. 

Passing now to the consideration of other variational treatments in 
which atomic orbitals have been used, we find that S. C. Wang 21 used 
a modified HL function of the form 



where C is a normalization constant, and Z, as before, represents an 
effective nuclear charge which was determined by solving equation (49), 
subject to the condition dE/dZ = 0. 

A more elaborate expression for ^ was utilized by N. Rosen. 22 He 
took into account the distortion in the charge distribution in each atom 
which must occur when the two atoms are brought together (the polari- 
zation effect). He writes: 

The simplest way to represent this distortion is to consider the radius of the atom 
to change with the distance from the other atom. This is effectively what Wang 
did in his calculations, and it led to a definite improvement in the energy value. 
However, since the perturbations involved are not spherically symmetrical this can- 

19 8. Weinbaum, J. Chem. Phys., 1, 593 (1933). 

20 E. A. Hylleraas, Z. Phytik, 71, 739 (1931). 

21 8. C. Wang, Phys. Rev., 31, 579 (1928). 

22 N. Rosen, ibid., 38, 2099 (1931). 
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not be a very good approximation to the true state of affairs, and the next improve- 
ment that suggests itself is to introduce a change in the wave function that will 
depend on the direction with respect to the molecular axis and will be greatest in the 
direction of the latter. Since the interactions can be thought of roughly as being 
along this axis, it seems likely that the electron cloud tends to bulge out in the direc- 
tion of the second atom. 

In accordance with this idea, Rosen used a modification of Wang's 
" trial function" as defined in the previous equation, of the form 



+ j8r A i cos A i) (1 + 0r B2 cos B2 ) 
A2> (1 + rm cos B i) (1 + 0r A2 cos A2 )], (65) 



where #AI is the angle between TAI and R (the line joining the nuclei), 
/3 is a variable parameter, and C is the normalization constant. Sub- 
stituting this function in the variation equation (49), the effect was 
first determined of varying the effective nuclear charge a, assuming 
= 0. The value of a thus obtained which corresponded to a minimum 
value of E was then substituted in equation (65) and used to calculate 
a minimum value of E for variations in . The final result led to a 
value for D e of 4.02 v.e., with the corresponding values of the two 
parameters a = 1.19, and ft = 0.10 for the equilibrium distance R = 
1.416a . 

The most accurate solution is that of H. M. James and A. S. Coolidge. 23 
The method used by them is similar to that of Hylleraas for helium 
(see Chapter XI). They introduced the four elliptic coordinates 

fog - 



R R 



and the variable p = 2ri 2 /JB, thus taking into account the electron cor- 
relations discussed in the previous section. As trial function they 
selected the mokaular orbital defined by the series 

, (66) 

where the summation extends over all positive values of the exponents 
(including zero), " subject to the restriction required by nuclear sym- 
metry that j + k must be even, and taking as many terms as shall prove 
necessary to give an acceptable approximation for the energy.*' 

28 H. M. James and A. S. Coolidge, J. Chem. Phys., 1, 825 (1933). 
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As a first approximation only the exponential part was used, that is, 



For each value of R there is a best value of 8. For the value R 1.40 
atomic units (the value deduced from observations on band spectra) 

the lowest values of E = / <t>H<t>d,T leads to a binding energy of 2.56 v.e., 

which is comparable with that obtained by the Heitler-London function. 
As James and Coolidge remark, " In the energy thus calculated, there 
is nothing resembling the 'exchange integrals' of the HL treatment; 
this raises the question whether the importance of the exchange terms, 
frequently assumed to represent the essential nature and magnitude 
of chemical binding, may not have been overemphasized." 

The fact, thus pointed out, that a form of molecular orbital function 
may be chosen that does not lead to integrals of the type Hi 2 [see 
equations (52)] is of extreme significance for the physical interpretation 
of the Heitler-London calculation. 

The next approximation made by James and Coolidge is the use of a 
series for <t> in which the exponent p in equation (66) is put equal to zero. 
This means that the tendency of the electrons to avoid each other is 
completely neglected. Under these conditions the maximum value 
obtainable for the binding energy for R = 1.40 is about 4.27 v.e., which 
leaves about 0.5 v.e. to be accounted for by electron correlations. 

The simplest expression in which p occurs has the form 



-5(\i+X 2 ) 



X 2 ) 



which involves five terms. The exact method used for the determina- 
tion of the coefficients is described in the original publication. The 
value deduced for the binding energy by the use of this expression for <t> 
is 4.507 v.e. for R = 1.40, with 5 = 0.75. Actually, calculations were 
carried out with as many as 13 terms in the series. Table 1 taken from 
the paper by James and Coolidge gives a comparison between their 
results and those of previous investigators. 

It is of interest to compare these calculated values of D e with the 
results of observations on the energy of dissociation of H 2 , which we 
shall designate by DQ. Direct thermochemical determination by 
F. R. Bichowsky and L. C. Copeland 24 gave the value Z) = 4.55 0.15 
v.e., while H. Beutlcr 25 has deduced, from the observations on the 

24 F. R. Bichowsky and L. C. Copeland, J. Am. Chem. Soc., 50, 1315 (1928). 
26 H. Beutler, Z. physik. Chem., B29, 315 (1935). 
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vibrational energy levels of H 2 in the normal state, the value D = 4.45 
(= 102,700 cal./mole). Since the energy minimum (~D e ), as calculated 
by the methods of quantum mechanics, includes the zero-point energy 
0.27 v.e. ( = J/i^o as shown in the following chapter), the observed value 
of D e is 4.45 + 0.27 = 4.72 v.e. (= 108,900 cal./mole), which is in 
extremely satisfactory agreement with that deduced by James and 
Coolidge. 

TABLE 1 

Function D e (electron volts) R/OQ 

I term 2.56 1.40 

5 terms 4.507 1.40 

11 terms 4.685 1.40 

13 terms 4.698 1.40 

Without r 12 4.27 1.40 

Heitler-London 2.9 1.40 

Sugiura 3.2 1.51 

Wang 3.76 1.42 

Rosen 4.02 1.416 

Observed 4.72 1.40 



SUPPLEMENTARY NOTE 1 

TRANSFORMATION FROM CARTESIAN TO CONFOCAL 
ELLIPTIC COORDINATES 

The consideration of Figs. 61a and 616 will illustrate the physical 
significance of this coordinate system. 

If 2a designates the distance between two points regarded as foci 
of a system of ellipses and hyperbolas, then the major axis of any one 
ellipse is given by 



9 



where 



1) is the eccentricity of the given ellipse. 




(b) 



FIG. 61. Confocal elliptic coordinates. 
Similarly, for any confocal hyperbola, the major axis is given by 



where 6 2 (^ 1) is the eccentricity of the hyperbola. 

For any point P', at which these two curves intersect, 



AP' + BP' = r A + r B 



2a 



2aX, 



(i) 



and 



2a 



AP r - BP f = r A - r B = = 2a M , 



(ii) 



where X = \/e\ and M = l/6 2 . 

Thus, each ellipse of the system of confocal conies has a definite value 
of X, and similarly, each confocal hyperbola has a definite value of M> so 
that a point P may be designated by specifying the values of X and p 
for the two confocal conies which intersect there. 
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If now we rotate these curves about AB as axis of revolution, there are 
obtained a series of confocal conicoids which in this case are known as 
prolate spheroids. The ellipsoids and hyperboloids of revolution will 
intersect along circles such as that indicated in Fig. 616, and hence it is 
necessary to introduce a third coordinate variable, the angle 0, which 
specifies the position of P on the circumference of the circle, with respect 
to a fixed plane (indicated by the line P'NP 1 ) which passes through the 
axis AB. 

It is necessary now to derive the relations between the variables 
X, p, 9 and the rectangular variables x, y, z. Assuming that the a>axis 
coincides with the axis of revolution AB, that the z-axis is in the plane of 
the figure and passes through the origin 0, in the midpoint of AB, 
while the y-axis projects at right angles to the plane containing Oz and 
Ox, we can set down the following relations which are derived in any 
textbook on solid geometry. 



4- 



At the point P the same values of x, y, z must satisfy both of these 
equations. Hence, 

r 2 = y * + Z 2 = a 2 (X 2 - 1) (1 - M 2 ). 

This defines the radius r(= NP) of the circle along which the two 
surfaces intersect, and consequently, as is evident from Fig. 616, 

y = aV(\ 2 - 1) (1 - M 2 ) ' sin 0; (v) 

z = a\/(X 2 - 1) (1 - M Z ) cos 0. (vi) 

By substituting in either equation (iii) or (iv), it is readily shown that 

x = aX/i. (vii) 

As in section 6.2, we now have to determine the three constants 
a\, a M , and ae, which connect the elements of volume as expressed in the 
two systems of coordinates, according to the relation 

dxdydz = Va\a M a0 dkdpdff. 
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Since X, MI and form an orthogonal system of coordinates, it follows 
that 



(x) 



where (ds) 2 - (d*) 2 + (dj/) 2 + (<fe) 2 . 

These differential coefficients, as derived from equations (v), (vi), 
and (vii), are as follows. 




dx ^ dy 

= Xa; = M^sa 

<9/z d/i 




-oV(X 2 - 1) (1 - M Z ) ' sin e 



Hence, 



82 



Q 8 (X 



(xii) 



(xiii) 



and 



Q 2 (X 



dxdydz = c 3 (X 2 - 



(xv) 
(xvi) 
(xvii) 
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The same result follows from the relation in terms of the Jacobian, 
which is of the form 

dx dy dz 

ax ax d\ 



dxdydz 



dx dy dz 
dp dp dp 

to ety dz 

~de se 3d 



As may be verified by direct substitution from equations (xi), (xii), 
and (xiii), the value of the determinant is found to be a 3 (X 2 M 2 )- 

Applying the rules stated in section (6.2) (see also Appendix IV) for 
expressing the Laplacian operator in terms of the variables X, M> and 0, 
it is readily deduced that 



V 2 = 



a 2 (x 2 -M 2 )ax 



(x 2 - i) (i - 



(xviii) 



When the S. equation for a two-center system is expressed in terms 
of these variables it is usually much more convenient to separate the 
equation into three ordinary differential equations, and thus obtain a 
solution, than when other systems of variables are used. 26 

26 See discussion by H. Bethe in " Handbuch der Physik," XXIV, Part 1, p. 530 
et seq. A further discussion of the usefulness of elliptic coordinates in the treat- 
ment of the two-center problem will be found in the paper by W. G. Barber and 
H. R. Hasse*, Proc. Cambridge Phil. Soc., 31, 564 (1935). 



CHAPTER XIII 

VIBRATIONAL AND ROTATIONAL STATES OF THE 
HYDROGEN MOLECULE 

13.1 General Remarks. 1 The existence of band spectra, and the 
observations on the variation in specific heats of gases with temper- 
ature, lead to the conclusion that, in addition to the energy of excita- 
tion of electronic levels, the molecules also possess both vibrational and 
rotational energy due to the motions of the nuclei. Since the fre- 
quencies of these motions are small compared to those of the electrons, 
the electronic motion can adjust itself relatively instantaneously to the 
motion of the nuclei as if the latter were centers of force at rest. 

It is therefore possible to consider the energy states arising from 
nuclear motions as superimposed upon the electronic states. 

Let us consider a diatomic molecule, such as H 2 . As shown in the 
previous chapter, the potential energy for any two atoms A and B, 
which combine to form a molecule AB, is a function of the internuclear 
distance r, which may be represented graphically by a curve such as 
that shown in Fig. 62. Let C7(r) designate this function. By defi- 
nition, the force between the nuclei is given by 

FM- - 
F(T} ~ ~ dr 

At the value of r = r , for which U is a minimum, the force vanishes, 
while it is negative to the right of the minimum (corresponding to a net 
force of attraction) and positive to the left of the minimum (corre- 
sponding to a net force of repulsion) . If we assume U(r) for r = oo , 
then the energy at r = r (which is negative) corresponds to the dis- 
sociation energy which is usually designated by D. 

For values of U(r) > D, there are two values of r at which U(r) 
has the same value. These correspond to mean points of equilibrium 
for the vibrational motion of the nuclei, and the total energy is given by 
E v = U(r) + D. This energy is quantized; so that there exist a series 
of vibrational energy states, designated by the quantum numbers 
v = 0, 1, 2, 3, etc., between the values U(r) = D and U(r) = 0. 
Figure 62 shows such a series of vibrational energy states for the mole- 

1 See references at end of chapter. 
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cule H2 in the normal (Is'lLt) state, and the potential energy curve 
derived in accordance with Morse's equation, as explained in the follow- 
ing section. 

It is also observed that with any given vibrational state there are 
associated a series of rotational energy states, designated by the quan- 
tum numbers K = 0, 1, 2, etc. Thus, any line in the emission spectrum 



8000 

4000 
0.4 



> Internuclear Distance in A 
0.8 1.2 1.6 2.0 2.4 2.8 




L-4.45 
_472_ 
40000. . 



FIG. 62. Potential energy function and vibrational energy levels for normal 
state of H2 molecule. 

of a molecule may be represented as a transition from an upper rota- 
tional level K ', associated with the upper vibrational level v' and with a 
higher electronic state, to a lower rotational level K n ', which is asso- 
ciated with a vibrational level v 11 and a lower electronic state. That 
is, if v represents the wave number 2 of the emitted line, 

v - v e + G' v - (?;' + F'(v, K) - F"(v, K), (1) 

where v e denotes the wave-number difference between the lowest levels 

2 It is customary to give energy differences in terms of wave numbers. See 
Appendix II for conversion factors to other units, 
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associated with t/ v" = and K' * #" 0, G corresponds to the 
vibrational energy, and F'(t>, K) to the rotational energy associated with 
the higher electronic state, and similarly for G'l and F f/ (v, K). 

Thus Fig. 63 shows the lower vibrational energy levels for the mole- 
cule H 2 in the normal (1$'L|) and in the first excited (2p'2t) states. 3 
The latter state results from the interaction of one electron in the 
normal (Is) state and another in the 2s state. The difference in energy 
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FIG. 63. 



Normal and first excited electronic energy levels for H2 molecule and 
associated vibrational energy levels. 



for the states v = 0, K = is equal to 90,201 cm."" 1 (= 11.13 v.e.), 
and the spectrum emitted by transitions from the higher to the normal 
state lies in the ultraviolet region, as shown by the values of A (in 
Angstrom) which are attached to the different lines. 

Figure 64 shows the rotational energy levels associated with the level, 
v' = 3, in the upper electronic state, and the level, v" = 1, in the 
normal state of the H2 molecule. The values of v on the right-hand side 
are taken from the corresponding values in Fig. 63, 4 while the values 

3 The values of the energy levels for H2 indicated in this and the other figures are 
taken from the extremely interesting paper by C. R. Jeppesen, Phys. Rev., 44, 165 
(1933), " The Emission Spectrum of Molecular Hydrogen in the Extreme Ultra- 
violet." 

4 The difference between the values 4162 and 4157 for ? of the level v" 1 is of 
no significance in the present connection. 
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under A give the total increase in rotational energy (in terms of wave 
numbers) as the quantum number K is increased from to higher values. 
The transitions indicated are those corresponding to the line L in Fig. 63, 
where K' designates the upper levels and K" the lower levels, respec- 
tively. It will be seen that the " head " of the band, corresponding to 
the transition from K' = to K" = 0, has the value v = 89896 (or 
X 1112 A). This is shown in Fig. 63 as the third line from the left. 
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FIG. 64. Illustrating transitions between rotational energy levels associated with 
two different vibrational energy levels for Hz molecule. 

13.2 Vibrational Energy States. For the H 2 molecule in the normal 
state, the equilibrium internuclear distance is r = 0.74 A. As the 
molecule acquires increasing amounts of vibrational energy (due to 
increase in temperature of the gas), this internuclear distance increases. 
Assuming that the restoring force acting on each atom in a diatomic 
molecule AB is proportional to the displacement from the position of 
equilibrium for minimum value of U(r) (Hooke's law), it is possible 
to derive relations for both the potential energy as a function of r and 
the frequency of vibration. 5 

We shall consider the general case of a diatomic molecule consisting 
of atoms A and B. Let /XA and n& denote the masses of the atoms; 
let r denote the distance between the two nuclei for any given vibra- 
tional state, and let r designate the value of r for U (r) = D. 

5 R. de L. Kronig, " The Optical Basis of the Theory of Valency/ 1 p. 83. 
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Then, the equations of motion for the two nuclei are as follows (see 

Kg. 65): 

k(r - r ) = - 



FIG. 65. Illustrating the vibrational motion of two nuclei. 
Hence, 



*i - #2 --* + 

W MB 

That is, JL 

F - - - (r - r ), (2) 

M 

where M = - = " reduced " mass of molecule. 
MA + MB 

The solution of this equation is 

r r = A sin (2irvot + 6), 
where A is the maximum amplitude, d is the phase angle, and 



The potential energy function corresponding to equation (2) is evi- 
dently a parabola with vertex at r = r , the equation for which is 



Substituting from equation (3), this can be written in the form 

U(p) - -D + 2( ) V, (*) 

where p r - r , and v is replaced by co , in accordance with the 
notation used in the literature on band spectra. 

This is the function which must be inserted in the S. equation for the 
system in order to determine the discrete energy states. This equa- 
tion, which corresponds to equation (7.7), has the form 



o, (5) 

p 2 dp 
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where S = /S(p) is the " radial " function, K = 0, 1, 2, etc., and 2 = 



Put S(p) = -<t>(p). Then equation (5) becomes 
P 



dp 2 P : 



2 



(6) 



If we substitute for U(p) from equation (4), and let K = 0, that is, 
consider only the case in which there is zero rotational energy, and only 
vibrational energy, equation (6) becomes 6 



> 

* + a *{E + D - 2(o) V}* - 0. (7) 

ap 

This equation is evidently similar to equation (5.5) for the linear 
harmonic oscillator, and has as eigenvalues the series defined by the 
relation 

E v = -D + h^(v + i), (8) 



where v = 0, 1, 2, etc., corresponds to the vibrational quantum 
number. 

It follows that the vibrational energy levels should be equally spaced. 
Actually, the distance between successive energy levels decreases, in 
the case of homopolar diatomic molecules, with increase in v. That is, 
the atoms do not behave as simple harmonic oscillators. The motion 
is said to be of the anharmonic type, with the result that, to a first ap- 
proximation, the vibrational energy must be represented by an expres- 
sion of the form 

E v = -D + ftuo(tr + J) - tea> (v + |) 2 , (9) 



where x is a constant for any given molecule. 

For E = 0, that is, when dissociation occurs, it is evident that 
dE v /dv = 0. Consequently, 



+ |) = 0, 
that is, 



6 The solution discussed in the following section is that given by P. M. Morse, 
Phys. Rev., 34, 57 (1929); also in Condon and Morse, "Quantum Mechanics," 
Chapter V. 
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and 



D m 12 _ ^ '^> (10o) 

2x 4ar 4$ 

or n ,.2 * 

(10&) 



The right-hand side of this equation is expressed in such a form as to 
give the value for the dissociation energy in terms of wave numbers. 
Hence, to a first approximation, 



As shown by Morse, a potential energy function which is in agreement 
with this relation and with the requirement that E v shall be a minimum 
for p = may be chosen of the form 

C7(p) = -2D<T^ + D<r 2 to, (12) 



where /3 is obtained as follows. 
For small values of p, the last equation becomes 



C7( P ) - -D + 

This evidently has a minimum value U(p) = D for p = 0, that is, 
the minimum occurs at r = r . 

Comparing this with the equation (4) it follows that 



and substituting for D from equation (10a), 

/STT Mco<) ***** 
|8 = / = 0.2454' 

where 

M = M X 6.064 X 10 23 , 

= reduced mass in terms of oxygen (atomic mass =16). 

Now C. R. Jeppesen 7 found that, for values of the vibrational quan- 
tum number from v = J to v = 5^, the difference in wave numbers 
between successive levels is given by the relation 

AG, - 4417.19 - 262.63(t; + 1) + 9.34619(0 + 1) 2 - 0.76(0 + ) 3 . (14) 
The values of AC?* thus calculated are shown in the column on the 
7 C. R. Jeppesen, loc. cit. 
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right-hand side of Fig. 62. For v = 1, AG V = 4161.70, and it will be 
observed that the value of A(r v decreases with increase in v. 

More recently H. Beutler 8 has been able to determine, from observa- 
tions on the lines in the extreme ultraviolet spectrum, values of A(? v 
for v = 12, 13, and 14. If the values of AG V are plotted as ordinates 

V 

against 2* A0r v = v, the total increase in wave number from v = 

t>-0 

to v, the resulting curve (see Fig. 66) shows that A(? v = for v = 36,116 
cm."" 1 . Therefore, this value must correspond to the dissociation energy 
DO of H2 in the normal state. In terms of energy units, 



36,116 
8106 



4.455 v.e. 



= 4.455 X 23,055 = 102,700 cal./mole. 
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Fia. 66. Plot of &G V and internuclear distance (r*o) as functions of the wave numbers 
of the vibrational levels for the normal state of Hg. 

From equation (14) it follows that, for v = -, AG = 4352.2. This 
corresponds to twice the zero-point energy (hvQ/2), and therefore one- 
half this quantity must be added to DO to give D, the minimum value of 
the potential energy U(p). Thus, D = 36,116 + 2176 = 38,292 cm.- 1 = 
108,900 cal./mole. 

8 H. Beutler, Z. phy&ik. Chem., B27, 287 (1934); ibid., B29, 315 (1935). 

9 According to Jeppesen the extrapolation to v = -\ gives the best approximation 
to the value of 
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For H 2 (M = 1.008), the value of /3 according to equation (13) is 
2.00. Hence the potential energy function for H 2 , as derived from the 
spectroscopic vibrational energy levels, is given, in terms of wave num- 
bers, by the relation 

U(p) = 38,292(~ 4p - 2e~ 2p ). (15) 

The following table gives values of U(p) for different values of 
r = p + 0.74, in Angstrom units. 

r(A) U(p) r(A) U( P ) 

0.34 +18,997 1.14 -26,660 

44 -12,265 1 34 19,578 

54 29,062 1 54 13,906 

64 36,354 1 74 9,664 

0.74 38,291 1 94 6,633 

84 37,075 2 34 3,061 

94 34,123 2 74 1,390 

1 04 30,532 3 74 192 

Figure 62 was plotted from this data. The ordinates give the 
values of U (p) both in terms of i>, the wave number, and in terms of V, 
the corresponding energy in electron volts. The horizontal lines give 
the vibrational energy levels corresponding to the different values 
v = to v = 12, indicated on the curve. To distinguish D from D , 
the symbol D e is customary (as mentioned in the previous chapter) for 
the former, which evidently has only a theoretical interest since DO is 
the experimentally determined value. 

It will be observed that, according.to Jeppesen's data, 2zw and co , 
expressed in wave numbers, are equal to 262.63 and 4352.2 respectively. 

Hence, if we use equation (106), 



which is less than the value determined by direct extrapolation to 
A6r v = 0. As first shown by R. T, Birge and H. Sponer, 10 the latter 
method is the more accurate one for the determination of Z) . 

A potential energy curve similar to that for the normal state can also 
be plotted for any of the excited states for which sufficient data on the 
vibrational energy levels are available. According to Jeppesen, the 
values of AG V for the 2p']C* state are given by the relation 

AG V = 1357.302 - 39.9307(0 + J) 

+ 1.218487(0 + |) 2 - 0.0638888(0 + |) 3 , 

10 R. T. Birge and H. Sponer, Phys. Rev., 28, 259 (1926). 
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while the value corresponding to twice the zero-point energy is AG - 
1347.4 cm."" 1 . Since the value of 2xw is 39.931, the calculated value of 
D e is (1347.4) 2 /79.862 = 22,860 cm.- 1 . 

13.3 Rotational Energy Levels. Let us now consider the solution 
of equation (6) for the case K 7* 0. A solution of this equation has 
been given by C. L. Pekeris, 11 leading to an expression for the energy 
levels in which the effects of rotational and vibrational energies are 
combined. However, it is possible to indicate the significance of this 
more complicated expression without the formal mathematical deriva- 
tion. 

In equation (6.25) it was shown that in a rotating diatomic molecule 
with fixed moment of inertia / the rotational energy levels are given 
by the relation 

\), (16) 



where K is the rotational quantum number, and I is derived by the 
relation 



where A* is the reduced mass. 

Equation (16) is usually written in the form 

E r = BK(K + 1), (17) 

where 



In the case of H^, there are two atoms, each of mass /XH = 1.662 X 
1(T 24 gm. In the state v = 0, the internuclear distance is 0.74 X 
10~~ 8 cm. Hence, 

/ = 4.55 X KT 41 gm. cm. 2 , 

B = 1,19 X 1<T U gm. cm. 2 seer 2 , 

n 

and ,, = -- = 60.65 cm." 1 . 

he 

The latter is to be compared with the value 59.4 derived from the 
plot in Fig, 66. With increase in vibrational quantum number, B v 

11 C. L. Pekeris, Phys. Rev., 46, 98 (1934). See also L. Pauling and E. B. Wilson, Jr., 
" Introduction to Quantum Mechanics," Chapter X, for detailed discussion of this 
case. 
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decreases, as shown in this figure, and r increases in accordance with the 
relation 

1(T 15 
r = 3.32X--' 

The notation B v is used to indicate that the constant is measured in 
terms of wave numbers and also that the value is a function of the vi- 
brational quantum number. 

According to equation (16), the levels should be equally spaced, if I 
were a constant; but, as in the case of the vibrational levels, not only 
does I vary with v, but also, for a given value of t;, the spacing between 
rotational levels varies with change in K. Thus, in the case of H 2 and 
many other molecules, the value of AF, the increase in wave number 
for A# = 1, increases with increase in #, as is evident from Fig. 64. 
Consequently, the actual observations on rotational energy levels lead 
to the semi-empirical relation 



+ higher powers of f K + - J > (19) 



where F is expressed in terms of wave numbers. 

The first two terms on the right-hand side of this equation evidently 
correspond to the expression in equation (17), while the following 
terms are due to the effect of the vibrational energy. The expression 
deduced by Pekeris, which expresses the combined effects of rotational 
and vibrational energy, is: 12 



+ I) 2 - 70 v + K(K + 1), 

where 

. . . [see equation (10a)]; 



4D 



B v = 2 . . . [see equation (18)]; 
12 L. Pauling and E. B. Wilson, Jr., op. dt., p. 274. 
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and 0o and 70 are constants, the values of which depend on those of r , 
WQ> /I, and D. 

In Fig. 64 the values under Ai/ give the difference in wave number 
of any given level with respect to the imaginary level designated by 
a dashed line, while v gives the wave number of this lowest level with 
respect to the level v = for the normal H 2 molecule in state v = 0, 
K = 0. It will be observed that the differences between successive 
levels in both sets increase with increase in K, and that these differences 
for any pair of values of K' (the upper values) are less than for the 
corresponding pair in the set K fr (lower levels). 

Although it is possible to have transitions between any pair of vibra- 
tional levels, as shown in Fig. 63, this is not valid for rotational levels. 
According to the selection principle for transitions, only those tran- 
sitions may occur for which AK = 1. The lines corresponding to the 
transition from K f to K" = K' + 1 constitute the P-branch, while 
those corresponding to the transition from K' to K " = K r 1 con- 
stitute the /2-branch. In some cases, for reasons which we need not 
discuss here, lines are obtained for which K" = K f . These then con- 
stitute the Q-branch. 

From equation (19) it is seen that the difference between the level 
K = and the dashed level immediately below it, in Fig. 64, is equal 
to (&B V . Hence for the state t/ = 1, B' v = 4 X 14.1 = 56.1 cm." 1 , 
while for the level t/' = 3 in the 
upper electronic state, B" = 16.9 
cm."" 1 . According to Jeppesen, the 
values of B v for the different vibra- 
tional levels associated with the nor- 
mal state are given by the relation 

B v = 60.8715 - 3.06709(0 + ) + 
0.068393(0 + ) 2 - 0.0065(v + ) 3 . 
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Figure 67, taken from the original FlQ 67 plot of 

publication, shows the vanation in 
B v with v for three different electronic 
states of the Efo molecule. 

It follows from equation (18) and these plots that r increases with 
increases in v. That is, as the vibrational energy increases, the atoms 
tend to vibrate about positions of equilibrium which are increasingly 
farther apart, and when dissociation occurs, B v vanishes and r becomes 
infinitely great. In Fig. 66 there is shown a plot of r versus v which 
illustrates this conclusion graphically. It will be observed that, while 
the value of ro for v = is 0.75 X 10~ 8 , this increases with increase in 
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t>, and the rate of increase becomes greater as the stage of dissociation 
is approached, so that between v - 13 and v = 14, r increases from 1.49 
to 2.00 X 10~ 8 cm., and becomes infinite 13 between v = 14 and v = 15. 

13.4 Orthohydrogen and Parahydrogen. One of the most brilliant 
achievements of quantum mechanics was the deduction by W. Heisen- 
berg 14 and F. Hund 15 that hydrogen must occur in two forms analogous 
to ortho- and para-helium. These conclusions were supported by the 
observations on the relative intensities of rotational lines in the band 
spectrum of H2- D. M. Dennison 16 applied the same suggestion to 
interpret the observed decrease in rotational specific heat of hydro- 
gen at very low temperatures. Finally, K. Bonhoeffer and P. Harteck 17 
obtained direct evidence for the existence of the two different forms from 
measurements of the heat conductivity, while A. Eucken and K. Hiller 18 
obtained similar evidence from measurements of the heat capacity. 

Since then a great many investigators have interested themselves 
in the physical properties and chemical reactions of the two forms of 
hydrogen. The results of this work have been summarized by A. Farkas 
in a monograph. 19 The following remarks are based on the discussion 
in Chapter II of this treatise. 20 

As shown in the previous sections the rotational energy states of the 
hydrogen molecule are given by the relation 



where K designates the rotational quantum number. 

The corresponding eigenfunctions for a rigid rotator, as deduced in 
Chapter VI, are given by the relation 



Like the electron, the proton also possesses a moment of spin which has 
the value ^ in units of h/2ir, but unlike the electrons, the two protons 
in a hydrogen molecule can be either parallel or antiparallel in direction 

13 The values of B v for v = 13 and v = 14 are taken from the paper by H. Beutler, 
loc. tit. 

14 Z. Physik, 38, 411 (1926); 41, 239 (1927). 
16 Z. Physik, 42, 93 (1927). 

16 Proc. Roy. Soc. (London), A115, 483 (1927). 

17 Naturwissen&chaften, 17, 182 (1929); Z. physik. Chem., B4, 113 (1929). 

18 Z, physik. Chem., B4, 142 (1929); also K. Clusius and K. Hiller, ibid., B4, 158 
(1929). 

19 A. Farkas, " Orthohydrogen, Parahydrogen and Heavy Hydrogen." 

20 See also the discussion by the writer in Taylor's " Treatise on Physical Chemis- 
try," Vol. 2, p. 1372. 
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of spin. This is due to the fact that a completely antisymmetrical func- 
tion for the nuclei can be obtained with either direction of spin, and it is 
this phenomenon that accounts for the existence of two modifications of 
molecular hydrogen. 

Let us consider the effect of interchanging the two coordinates of the 
two nuclei. Because the two nuclei are identical and symmetrically 
located with respect to the center of oscillation, such an interchange has 
no effect as regards vibrational motion. On the other hand, in the 
rotational eigenfunction e** 11 is altered by a rotation through 180 
to iK ^ v \ If K is even, this leaves the function unaltered, but if K is 
odd, the function changes sign. Hence, iKrt is a symmetric function for 
even values of K and antisymmetric for odd values of K. 

As in the case of the helium atom and the hydrogen molecule, the 
nuclear spins combine to form three symmetric spin functions, for 
which w = "~1> 0> an d 1) and one antisymmetric function for which 
JZ m 8 = 0. Let ts designate the symmetric spatial function c t/Cl? and 
^A the corresponding antisymmetric function. Then the complete 
eigenf unctions for the molecule are given by the eight functions: 



, 

- a(2)j8(2)} 



/ 
V2 

and 



' 

V2 

The group (A) consists of antisymmetrical, (B), of symmetrical 
functions. It is possible, on the basis of experimental evidence, to 
decide which of these two groups actually represents the behavior of 
Ha. 

Now observations on the intensities of lines in the band spectrum 
show that the lines corresponding to transitions between odd values of 
K (such as 1 1; 3 >3, etc.) are three times as intense as those corre- 
sponding to transitions between even values of K (such as >0; 
2>2, etc.). This observation can be accounted for only by assuming 
that the first three functions in group A represent one modification of 
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H2, designated as orthohydrogen (by analogy with orthohelium), and 
that the fourth function in group A represents a second modification, 
designated as parahydrogen. That is, in the latter the nuclear spins 
are antiparallel while in orthohydrogen they are parallel. This is, of 
course, in accordance with the Pauli requirement that the complete 
eigenfunction for an atomic or molecular system must be of the anti- 
symmetrical type. 

Furthermore, the orthohydrogen molecule is a three-fold degenerate 
state. In a magnetic field the three eigenfunctions would correspond 
to three slightly different energy states. Hence, the statistical weight 
of the ortho modification is three times that of the para form; and 
we*would expect that ordinary hydrogen should consist of a mixture 
of three parts of orthohydrogen and one part of parahydrogen. This, 
however, is observed to hold only at room temperature and higher, 
because of the following considerations. 

At any temperature T, the thermodynamical equilibrium between the 
two forms is governed by Boltzmann's distribution law. Let N Q de- 
note the total number of molecules. The number in the rotational 
state K is given by the relation 



kT 

i 



where PR denotes the statistical weight. Since all para molecules have 
even values of K, and all ortho molecules, odd values, the ratio of the 
two modifications is given by 



E K 

"kT 



X-even 



K-odd 



Since PK = 2# + 1 for even values of K, 

= 3(2K + 1) for odd values of K, 

and E K - g^X( + 1) - BK(K + 1), 
it follows that 

_M 20B 

1 + 56 kT + 9t kT +. . . 

_2B _12B 

3(3* * T + 7 kT +. . .) 
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Now B = 1.19 X 1(T 14 gm. cm. 2 sec"" 2 (see section 3), and k = 1.371 
X 1(T 16 , Hence, B/k = 86.7. The values of ft for different values of 
T, as calculated by A. Farkas, 21 are shown in the following table. 

T Percentage of 

deg. K parahydrogen 

20 544.8 99.82 

30 32.1 96.98 

40 7.780 88.61 

50 3.327 76.89 

75 1.077 51.86 

100 0.6262 38.51 

150 0.3994 28.54 

210 0.3463 25.72 

273 0.3357 25.13 

oo 0.3333 25.00 

On the basis of quantum mechanics it has been shown that no tran- 
sitions (accompanied by emission or absorption of radiation) can occur 
between the two states. Hence, if ordinary hydrogen is cooled down to 
liquid air temperature, the ratio of para to ortho is not changed except 
in the presence of a catalyst such as charcoal. Similarly, if para- 
hydrogen is once prepared pure it does not change spontaneously as the 
temperature is raised. It is the existence of this phenomenon that was 
used by Dennison to interpret the anomalous observations on the heat 
capacity of hydrogen at low temperatures. 

COLLATERAL READING 

The literature on band spectra is quite extensive. The most recent treatise on 
this subject is that of H. SPONP:H, " Molekiilspektren und ihre Anwendungen auf 
chemische Probleme," two volumes, Julius Springer, Berlin, 1936. 

An excellent discussion has also been given by R. DE L. KRONIQ, " The Optical 
Basis of the Theory of Valency," The Macmillan Co., New York, 1935. 

Moreover, the reader will find the topic treated by RTJARK and UREY, " Atoms, 
Molecules and Quanta," Chapter XII, and by PAULING and WILSON, " Introduction 
to Quantum Mechanics," Chapter X. 

The papers by C. R. JEPPESEN and H. BEUTLER (see references in chapter) should 
also be consulted. 

21 Op. tit. p. 14. 



CHAPTER XIV 

VALENCE BONDS; ACTIVATION AND 
RESONANCE ENERGY 

14.1 The Homopolar Bond. In the previous chapter we discussed 
two alternative methods for calculating the energy of binding of two 
hydrogen atoms. The HL method, involving the use of atomic orbitals, 
is relatively simple, but leads to only approximate results; the method of 
molecular orbitals is more complicated, but, as shown by the work 
of James and Coolidge, it is possible by the use of these functions to 
obtain very accurate results. 

For many purposes, especially where only approximate results are 
desired, the relation derived by Heitler and London for the energy of 
the shared-electron bond may be expressed conveniently as the sum of 
two terms in the form 

E S = J + K, (1) 

where J is the Coulomb energy and K is the so-called " exchange " 
energy. This relation is essentially the same as equation (12.32), in 
which E n / (I + S 2 ) = J, and E 12 /(1 + S 2 ) = K. Since \K\>\J\ 
and both are negative, E s corresponds to an energy of attraction. 

Furthermore, the Pauli principle leads to the conclusion that, in the 
homopolar or shared-electron bond, the two electrons have opposite 
directions of spin. For two atoms with electrons of similar spin mo- 
ment the energy of interaction is given by 

E A = J-K, (2) 

and since EA is positive it must correspond to an energy of repulsion. 
Consequently, such electrons have been designated as " antibonding " 
in contrast to the " binding " type exhibited by two electrons of anti- 
parallel spins. 

Although, as stated previously, the concept of exchange energy is an 
artificial one, resulting from the mathematical arguments, equation 
(1) has nevertheless proved useful in the computation of activation 
energies. In that case it is assumed that ES corresponds to the ob- 
served energy (that is, 100,000 cal., approximately) and that the values 
of / and K are in approximately the same ratio as the terms En and 
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EW in equation (i#.32). Thus Eyring and his associates have assumed 
in some of their calculations that J/K = 1/9, approximately. 

14.2 Interaction of Two H 2 Molecules. These considerations are 
of special interest in deriving an approximate value for the energy of 
interaction of two H 2 molecules. 1 At fairly large intermolecular dis- 
tances the forces acting between the molecules are of the van der Waals 
type, and, as shown in Chapter VIII, there is a weak force of attraction 
which varies as 1/r 7 . The experimental fact that H 2 condenses to a 
liquid only at extremely low temperatures and high pressures shows 
that at small intermolecular distances there is a force of repulsion. 

As Penney remarks, 

The repulsive forces come about in quite a different way (than the attractive forces). 
In order to form a stable molecule, two hydrogen atoms interact in such a way that 
there is a piling up of charge density between the nuclei. This gives a stable mole- 
cule because both electrons spend an appreciable amount of their time in the region 
where both nuclear fields are large. Thus, if two hydrogen molecules approach one 
another, neither has enough electronic charge density in the region between any 
two protons that come together to counter-balance the repulsive forces, and the 
molecules therefore separate again. A more precise argument in terms of spins 
and energy integrals is as follows. We have seen that in order to form a stable H2 
molecule, the spins of the two electrons must be coupled to give a resultant zero. 
If the spins are coupled to give a resultant unity, a state with statistical weight 
three, strong repulsion occurs. Consider the relative orientations of the spins of two 
electrons located in different H2 molecules. On the average, the two spins will 
behave as if they were coupled in a singlet state (Sm, = 0) for one-quarter of the 
time, and in a triplet state (2w = 1, 0, +1) for the remaining three-quarters of 
the time. 

Consequently, the energy of interaction J(H 2 ) of two hydrogen atoms 
in different molecules is given by the relation 

E s 3J-3K + J + K 



-J-f (3) 

It should be stated that this relation is valid only as long as the inter- 
molecular distance is at least two or three times the internuclear distance 
in the molecule. 

By means of the Morse curve which expresses ES as a function of 
internuclear distance (see equation 13.12) it is evidently possible, with 
an assumed value for the ratio J/K, to calculate J5?(H 2 ) as a function 
of intermolecular distance. 

1 W. G. Penney, " The Quantum Theory of Valency," p. 21. 
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14.3 Pauling's Theory of Directed Valence Bonds. Though Heitler 
and London showed that, on the basis of their theory of thehomopolar 
bond, it is possible to account for the observed valences of the elements 
of the first two periods, 2 little or no attention was paid in this early 
work to the problem of directed valence bonds. The first successful 
efforts to treat this topic by the methods of quantum mechanics were 
those published by J. C. Slater 3 and, independently, by L. Pauling in a 
series of papers published since 1931. 4 Further investigations on the 
structures of specific molecules, such as H 2 and CH 4 , have also been 
published by J. H. Van Vleck, 5 as well as by W. Heitler, E. Huckel, 
and G. Rumer. 6 

On the whole, the method used by L. Pauling, especially in his first 
papers on this topic, is more " physical " in the sense that it relies to a 
large extent on a geometrical interpretation of the significance of elec- 
tron eigenfunctions in the formation of bonds. Even though some of 
the conclusions reached by Pauling may not prove to be sufficiently 
well founded, his method of attacking the problem of valence bonds is 
certainly extremely suggestive, and it is largely for this reason that 
the topic has been discussed in the following sections. Slater's method 
is more quantitative, inasmuch as it leads to approximate methods for 
calculating both the direction and energy of bond formation. For this 
reason, however, it presents greater mathematical difficulties, and the 
calculation becomes extremely tedious when dealing with polyatomic 
molecules. While Slater, Pauling, Eyring, and other investigators have 
developed " short-cut " rules by which roughly quantitative results 
can be deduced without too much labor, it is possible, in this chapter, 
to touch only upon the simplest aspects of the methods of solution 
used by the different investigators. 

In his first paper, Pauling introduces the following six postulates 
which are to be used as a guide in the determination of relative energies 
and directions of different bonds in molecule formation, The first three 
are merely a restatement of the HL theory: 

2 See discussion by J. H. Van Vleck and A. Sherman, Rev. Modern Phya., 7, 167 
(1935), especially pp. 196-197. The abbreviation V.V.S. will be used in subse- 
quent references. This topic has also been discussed by the author in "Treatise 
on Physical Chemistry," by H. S. Taylor, Vol. 2, pp. 1369-1372. 

8 J. C. Slater, Phys. Rev., 37, 481; 38, 1109 (1931). 

4 L. Pauling, /. Am. Chem. Soc., 53, 1367 (1931). 

6 References given by V.V.S. 

6 The mathematical technic used by these investigators is quite complex, but 
the results obtained are not essentially different from those deduced by Slater and 
Pauling. 
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1. The electron-pair bond is formed through the interaction of an unpaired elec- 
tron on each of two atoms. 

2. The spins of the electrons are opposed when the bond is formed, so that they 
cannot contribute to the paramagnetic susceptibility of the substance. 

3. Two electrons which form a shared pair cannot take part in forming additional 
pairs. 

To these Pauling adds three more rules " which are justified by the 
qualitative considerations of the factors influencing bond energies." 

4. The main resonance (exchange) terms for a single electron-pair bond are those 
involving only one eigenf unction from each atom. 

5. Of two eigenfunctions with the same dependence on r, the one with the larger 
value in the bond direction will give rise to the stronger bond, and for a given eigen- 
function the bond will tend to be formed in the direction with the largest value of 
the eigenfunction. 

6. Of two eigenfunctions with the same dependence on and <, 7 the one with the 
smaller mean value of r, that is, the one corresponding to the lower energy level for 
the atom, will give rise to the stronger bond. 

Let us consider the application of these rules to the determination of 
bond directions in such molecules as H 2 and NH 3 . According to the 
Lewis-Langmuir theory of valence these molecules are represented as 
H:0:H and H:N:H, respectively. The electron in a normal hydro- 
it 

gen atom is in the Is (n = 1, 1 = 0) state. The electron configurations 
of N and in the normal state are (2s) 2 (2p) 3 and (2s) 2 (2p) 4 , respec- 
tively. The 2s electrons are paired and therefore do not take part in 
bond formation (except when a change occurs in quantization of the 
electron eigenfunction owing to bond formation, as will be discussed 
subsequently). Hence the electrons which act in bond formation in the 
case of N and are of the type 2p (n = 2, 1 = 1). 

Now, as Pauling points out, s and p eigenfunctions with the same 
value of n, " do not differ very much in their mean values of r, but their 
dependence on 6 and < is widely different." 

For s eigenfunctions, ^ n ofo 0> <) - S n o(r) * s(0, </>) 

For p eigenfunctions, * n i(r, 0, *) = S n i(r) p(0, 4>). 

The s eigenfunction is spherically symmetrical, and from Table 2, 
Chapter VII, it is seen that the normalized function has the form 



V47T 

7 The symbol # will be used in this chapter for the angle 17 used as variable in 
Chapter VI and subsequent discussions. 
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There are three p eigenf unctions : 



Xio(0)Z (4>)=J -cose, 

/ w 



3 . 

sm i 
4ir 



* 



^ (9, 0) - XL-I ()_!(*)- sin 

Since p r + and p T _ are complex, it is desirable to replace them by 
real functions which can be represented as functions of the rectangular 
coordinates z, j/, and z. 

Replacing e t0 by (cos </> i sin 0), we obtain the real functions: 



Px 



sn cos 

sin sin <t> 
\/3 cos ^ 



(4) 



as compared with the eigenfunction 



where the factor l/V^Tr has been discarded, since we are interested only 
in relative magnitudes. 
It will be observed that p z is identical with p ff and that 




8-1 



FIG. 68. s and p eigenf unctions (Pauling). 



For ^ = 0, Pa? = V^3 sin d; and p y = 0. Thus, A/3 sin represents 
a section of the spherical function p x in the zz-plane, in which it has its 
maximum value for any given value of 6. As shown in Fig. 68 this 
function is represented by two circles in contact at the origin and each 
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of diameter \/3 units, as compared with the s eigenfunction which, on 
the same scale, is represented by a circle of unit radius. 8 

Thus | p x | consists of two spheres, with the long axis in the direction 
of the z-axis. The similar functions | p y \ and | p z \ have their long 
axes in the direction of the y- and z-axes, respectively. In Fig. 38, 
Chapter VII, the distribution functions are shown corresponding to these 
three p eigenfunctions. " From Rule 5," as Pauling observes, " we 
conclude that p-electrons will form stronger bonds than s-electrons and 
that the bonds formed by p-electrons in an atom tend to be oriented at right 
angles to one another." 

Van Vleck and Sherman 9 state the argument for this conclusion as 
follows: 

I^et us suppose there is an electron-pair bond between an s electron of some 
attached atom and the pa x electron of the central atom. Then the exchange energy 
associated with this particular pair is greatest if the attached atom lies on the a>axis, 
since the exchange integrals will clearly be largest in absolute value if the wave func- 
tions of the two atoms overlap as much as possible. This requirement clearly de- 
mands that the attached atom be located on the axis of the dumb-bell associated 
with the particular electron of the central atom with which it is paired. 

If a second atom is brought up, and if the pairing between p x and the first attached 
atom is not broken, then clearly the only possibility is for the second atom to pair 
with one of the other wave functions, p y or p Zl so that it will become located on the 
2/- or z-axis. Hence in a molecule such as H20 the angle between the two OH axes 
should be 90. The experimental value is 106. The departures from 90 are to be 
blamed upon repulsions between the attached atoms and upon sp 2 hybridization. 
Similarly, if the first two atoms have preempted the x and y directions a third atom 
tends to become located on the z-axis, so that in a molecule like NHs the three NH 
axes should make angles of 90 with each other. The NHs molecule is then pyram- 
idal in structure, each axis making an angle of 54.7 with the axis of the NHa 
pyramid. The experimental value is 67, and the discrepancy is to be attributed to 
the same causes as in H 2 O. 

14.4 Change in Quantization of Bond Eigenfunctions. In the case 
of a normal carbon atom, with the electron configuration 2s 2 2p 2 (known 
spectroscopically as 3 P state) there are only two unpaired electrons, and 
this would account for the double bond in C: :O. Only about 1.6 v.e. 
of energy (36,900 cal./mole) is required to excite one of the 2s 
electrons to a 2p state, and this would give three unpaired p electrons 

8 This follows from the following simple consideration: 

For any point on the circle, the distance from the origin is given by r = Vx 2 + z 2 . 

Let D denote the diameter of each circle. Then it follows from the properties of any 

triangle inscribed in the circle on D as a base, that 



D 

1 V.V.S., op. tit., p. 199. 
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and one unpaired s electron, thus accounting for the valence of four 
as in CBU. But, since the s bond is not a specially directed one, this 
answer cannot be sufficient. A much more satisfactory point of view 
is the introduction by Pauling of a new concept that of " hybridiza- 
tion " of eigenfunctions to form combination functions which take the 
place of the single electron functions. Thus, in the case of the forma- 
tion of CH 4 , if the energy of interaction per H atom is greater than 
the difference in energy of the electron in the 2s and 2p states, then the 
interaction will cause the electron to be " promoted " and now we 
must consider each bond as being formed by the grouping together of 
hydrogen-like s and p eigenfunctions. As Pauling points out this cri- 
terion is satisfied for quadrivalent carbon, and he therefore proceeds to 
determine the zero-order eigenfunctions " which will form the strongest 
bonds for the case when the s-p quantization is broken." He assumes 
that the four bonds are each represented by one of the four combina- 
tion eigenfunctions: 

^i = <M + bip x + Cipy + dip zj (5) 



where i = 1, 2, 3, or 4, and the coefficients are subject to the orthog- 
onality and normalization requirements 10 



tfdr = 1 or a? + 6? + c? + d?-l, (6) 

/ 

and 

I tyj&kdr = or fliflfc + bibfc H~ cc& + c^-c^ = 0, (7) 

where i 5^ /fc, and i, A; = 1, 2, 3, 4. 

For a single bond, we can choose the direction arbitrarily. If we 
take it along the x-axis, for which p y = p z = 0, the corresponding eigen- 
f unction has the form 

The maximum value of this function is evidently 
M= (* 



(where the subscripts may be discarded). 
Also since a 2 + 6 2 = 1, 



10 This point is discussed in a footnote in the paper by V. V. S., p. 202. Equation 
(7) is valid if the functions s, p x , p y , and p z are mutually orthogonal. 
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Introducing the condition dM/db = 0, it follows that 



and 

a = J. 
That is, 

1 V3 



This has a maximum value M = 2, which is considerably greater 
than the value 1.732 for a pure p eigenf unction. 
Figure 69 shows a graph of this function in the 
zz-plane. 

A second bond function may be introduced \^ 
in the same plane, of the form 




= a 2 62 V3 sin 6 + d 2 V3cos 6. FIG. 69. Tetrahedral sp 

eigenf unction (Pauling). 

The negative sign is due to the fact that for 

this bond cos < must be equal to 1 as compared to the value +1 for 
^i in the xz-plane. 

Applying the condition for normalization and the additional require- 
ment 

12 + &1&2 = 0, 

it is readily shown that 

# 2 = 6 2 V3(1 - sin 0) + Vl -461^0080. 

This will have a maximum value M = 2 for definite values of and 
6 2 > such that cM> 2 /d6 2 = and d#2/d& = 0. From these conditions it 
follows that 

sin cos <t> = -; <l> = 180, = 19 28'. 

That is, the second bond eigenf unction makes an angle of 109 28' with 
the first, " which is just the angle between the lines drawn from the center 
to two comers of a regular tetrahedron." The actual expression for the 
bond function has the form 
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By similar methods it may be shown that the third and fourth eigen- 
functions are 

*s ^o* -- rV*~ 
2 2\/3 

and 



which are directed toward the other two corners of the tetrahedron. 

Pauling also points out that an equivalent set of four tetrahedral 
eigenfunctions is 



- Py - 



which differs from the previous set by a rotation of the atom as a whole. 

" This calculation," as Pauling remarks, " provides the quantum 
mechanical justification of the chemist's tetrahedral carbon atom, 
present in diamond and all aliphatic carbon compounds/ 7 as well as for 
a number of other tetrahedral atoms and ions. Furthermore, since 
" each of these tetrahedral bond eigenfunctions is cylindrically sym- 
metrical about its bond direction, the bond energy is independent of 
orientation about this direction, so that there will be free rotation about 
a bond." On the other hand, there can be no free rotation about a double 
bond. 

The reader will be well repaid by studying further Pauling's original 
paper upon which the discussion has been based, as well as his subse- 
quent papers. There is a wealth of material there which is of great 
significance for the quantum mechanical interpretation of directed 
valence bonds. 

14.6 Slater's Treatment of Polyatomic Molecules. 11 The problem 
which Slater has attacked is that of calculating the electronic energy 
states of molecules from a consideration of the interaction of the atomic 
orbitals. The calculation is essentially an extension to three or more 
atoms of the method of secular equations discussed in section 12.5. 
Starting with one-electron wave functions which involve coordinates 
both of position and spin, zero-order wave functions are built up which 
are antisymmetric in the electrons (in accordance with Pauli's principle). 

11 J. C. Slater, Phys. Rtv., 38, 1109 (1931). 
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Let 1, ^2 ^i denote such combinations. If H is the Hamil- 
tonian operator of the problem, then the secular equation to be solved 
for the energy is of the form 



H\\ Edn, HIV 

, #22 



H it Ed\i 



HH 



(12) 



where 



dij = J *&, 



(13) 



and EI, E% . . . JE?3 represent the roots of the equation. 

The degree of the secular equation will not exceed AT 2 , where N is the 
total number of electrons, although it is usually possible to separate it 
into a number of equations of lower degree. 

Instead of attempting to review Slater's rather lengthy paper in terms 
of general statements, it will be more instructive to consider in detail 
the method used in the solution of one particular problem. Moreover, 
the problem we shall discuss is of importance in connection with the 
treatment by H. Eyring and M. Polanyi of the topic of activation 
energy. 

Let us consider the interaction of three atoms, each containing a single 
electron in the s state. We can assign a one-electron wave function to 
each atom and we shall denote these by A, B, and C. Each of these is 
a function of three coordinates of position, and one of spin. Let the 
coordinates (of position and spin) of the first electron be denoted by 1, 
of the second by 2, etc. An approximate function which might be a solu- 
tion for the unperturbed state of the system is the product of the 
individual one-electron functions of the form A(l) B(2) C(3) or A(2) 
#(1) C(3). However, there is one combination of all the different 
possible permutations of the functions which is the only function that 
is antisymmetric in the electrons. This has the form 



(14) 



A/3! 



A(l) A (2) 
5(1) B(2) 

C(2) C(3) 



If there were no electron spin, this would be the only antisymmetric 
eigenfunction, and there would be only one corresponding eigenvalue. 
However, in the present case we must take into account the fact that 
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each electron may have either positive or negative spin. Hence, there 
will actually be 2 s = 8 different possible eigenfunctions and 8 corre- 
sponding eigenvalues. These states are indicated in the following 
table by Roman numerals, where a designates the part of the function A 
depending on the coordinates of position, and similarly 6 and c, while 
the two spin functions are indicated by + and . The last column 
gives the total spin. 



Spin 
of a 



I 

II 

III 

IV 

V 

VI 

VII 

VIII 



Spin 
ofb 



+ 
+ 



Spin 
of c 



Total 
spin 



" States I and VIII give," as Slater points out, " two of the four 
states of the quartet. II, III, IV give a cubic, one of whose roots gives 
another state of the quartet, and the other two roots give the two 
doublets. Similarly, V, VI, VII yield the fourth state of the quartet 
and the other two states of the doublet." 

The antisymmetric eigenfunction corresponding to state I is evi- 
dently of the form 



A/3! 



030(3 



(15) 



where a\ designates the part of the function A\ depending on the coordi- 
nates and i the part which involves the spin. We shall use a and 
to correspond to + and directions of spin, respectively. 
Similarly, the functions for states II and V are given by 



a\Ct\ 



030:3 



and 



1 

\/3! 



30:3 



(16) 



(17) 



The eigenfunctions for the other states can be written down in a simi- 
lar manner. 
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We now have to consider the form which each matrix element will 
assume in the secular equation for calculating the energy states. The 
energy operator for a molecule with three fixed nuclei and three electrons 
has the form 



(18) 



where the summations over i and j refer to the electrons, and those over 
a and refer to the nuclei. The Z's represent the nuclear charges, 
and the r's the distances of separation. 

We have in addition the fact that each function a, 6, or c satisfies a 
one-electron S. equation 



where V a is the potential function for the one-electron problem and E a 
the energy value. If we substitute this into the operator H, we obtain 
the relation 



+ r^ + :S^}(a 1 6 2 C3). d9) 

a|8 r ij T V> 

The matrix elements which must be formed from the functions 
are of two types: 



(1) those of the form Hi t i or dj t i, 
and 

(2) those of the form Hi t u or di t n, 
where 



and similarly for HI, n and di t H. The element of volume in coordinate 
space is designated by dr, and the element in spin space by dw. 



Let us consider first the integral of the second type 






where the symbol P denotes a permutation of the functions a, 6, c over 
the three electrons, and the sign is used, depending on whether the 
permutation is even or odd. Since H is symmetric in the electrons, it 
follows that each term of the first summation yields the same result, 
and since there are just 3! = 6 terms, we can write 



HI, H = / 



There are six terms in this summation. One of the terms, taken 
at random, has the form 



= / 



If we assume that there is no interaction between the spin part of the 
wave function and the coordinate part, then we can write this in the form 



Now 
while 



d (dr) 3 J ' 

/ afdw = / $dw = 1 where i = 1, 2, or 3, 
I a^'dw = 0. 



Hence 7 = 0, and for the same reason, every term in Hi t n must be 
equal to zero. Also it follows that di t n = 0. This conclusion may 
be stated in the more general form, that matrix elements involving eigen- 
functions belonging to different values of the total spin are equal to zero. 

We shall now introduce a convenient notation used by Slater. In- 
stead of using the subscripts 1, 2, or 3, he indicates these by the order 
in which the functions a, 6, and c are written. Furthermore, the inte- 
gral sign and element of volume are omitted, so that we have 



&ic 2 a 3 ) (dr) 3 = (abc/H/bca), 
and 



f (ai& 2 c 3 ) (&ic 2 a 3 ) (dr) 3 = (abc/l/bca). 
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The corresponding integral over both dr and dw will evidently vanish 
identically unless a, 6, and c have the same spin. 

We can now proceed to calculate the different matrix elements in the 
secular equation for the problem under discussion. Let us consider the 
element 

Hi, i - f(abc)H{E P(abc)}(dr)*. 

There are six terms, of which three are positive and three negative, as 
is readily seen from an inspection of the expression in equation (15) for 
j. These terms are as follows : 

dbc cba 

bca acb 

cab bac 

Therefore, 

Hi, i = (abc/H/abc) + (abc/H/bca) + (abc/H/cdb) 

- (abc/H/cba) - (abc/H/acb) - (abc/H/bac), (20) 

and 

4, i = (abc/1/abc) + (abc/l/bca) + (abc/1/cab) 

- (abc/1/cba) - (abc/l/ad!) - (abc/1/bac). (21) 

Since Hi t u = Hi t m, etc. = 0, and similarly for di t n, etc., it follows 
that the first row in the secular determinant is 



ffi, i - 4, i#i - 0, 
while the eighth row is 

#vm, vin dvin, vin^i = 0, 



where Hi t i = ffviii, vm, and dj t i = dym, vin- Consequently, 
gives the energy of two of the four states of the " quartet." 
That is, 



i, I 



It is of interest to consider the significance of the various terms in 
Hi, i and di t j. 
Evidently, 

(abc/1/dbc) - 1. 
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Also (dbc/H/dbc) is a Coulomb integral, since [according to equation 
(19)] it represents the sum of Coulomb interactions between all pairs of 
electrons in the three atoms. 

A term such as (abc/H/acb) in which two of the functions are inter- 
changed represents an exchange integral (in this case between b and c) 
such as Heitler and London found in the interaction of two hydrogen 
atoms. If the functions on the two sides of the operator H differ by a 
cyclic permutation of three electrons, as in (abc/H/cdb) the resulting 
integral is zero, if the functions are orthogonal. But even if the func- 
tions are not quite orthogonal, integrals of this type have much smaller 
numerical values than the simple exchange integrals and may therefore 
be neglected in a first approximation. 

Thus, if we consider the case in which one atom, say c, is at a distance 
from the other two atoms, so that the only permutations that count 
are those between a and 6, 

_ (ab/H/ab) - (db/H/ba) f 
l ~ I -(06/1/60) 

which is the energy corresponding to the antisymmetricaleigenf unction 17 
in the Heitler-London theory . [See equation (12.35).] 

Passing to the consideration of the other matrix elements, we obtain 
the result 

. ii = (abc/H/abc) - (abc/H/bac). (23) 



The absence of other terms is due to the fact that we will have non- 
vanishing terms only in those cases in which a and 6 (functions with 
similar spin) are interchanged. All terms in which a and c or 6 and c 
are interchanged will vanish. By similar arguments it may be shown 
that 

#m, in = (abc/H/abc) - (abc/H/cba) 
HIV, iv = (abc/H/abc) - (abc/H/acb) 



#n, in = (abc/H/cab) - (abc/H/acb) 
#m, iv = (abc/H/cab) - (abc/H/bac) 
#11, iv = (abc/H/cab) - (abc/H/cba) ( 



(24) 



As Slater points out, " The matrices of unity are just the same with 
1 substituted for #. In the last formula, we have used the fact that 
(dbc/H/bca) = (abc/H/cab). This follows from the following two 
steps: (abc/H/bca) = (ca6/#/a6c), since from definition we can make 
any permutation of the first set of indices, if only we make an identical 
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permutation of the second set at the same time; and (cab/H/dbc) = 
(abc/H/cab), since the matrices are Hermitian and real." 12 
The resulting cubic equation is of the form 



#11, II - du t 
Hm t ii dm, 
HIV, n div, 



in ~ du. 
#m, in ~ dui, 
#iv, in rfiv, 



iv ai 
#m, iv~ dm, 
#iv, iv div, 

= 0. 



(25) 



Instead of attempting to solve this equation, Slater adopts a method 
which has been used successfully by other investigators subsequently. 
Such linear combinations of the functions #n, #m, and #iv are chosen 
as will transform the determinant into the form 










HAA - d AA E 
HAB 



HBB 



0. (26) 



In this particular case the new functions A, B, and D are defined as 
follows 

A = - (#n - # m ) 
V2 



= (#m-#iv) 

V2 



D = 



: 
V3 



(27) 



Slater also introduces a fourth function 



C = = 
V2 



(28) 



It is evident that these four functions are not linearly independent, 
since A + B + C = 0. The advantage inherent in the transforma- 
tion arises from the fact that each of the functions A, B, and C repre- 

12 The italics have been introduced by the writer. The reader will find it 
easier, in the beginning, to verify the relations given for the different matrix ele- 
ments by direct expansion of the products of two determinants, such as those for 
*ii and ^m. The main point is that, wherever a product such as ajft occurs, the 
corresponding term vanishes. This accounts for the fact that only two terms occur 
in each of the matrix elements, in the group of equations (24). It will be recog- 
nized that these terms are of either the Coulomb or exchange type. 
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sents a bond eigenfunction. Thus A corresponds to the interaction of 
atoms 6 and c, B to that of a and 6, and C to that of c and a. The 
total spin in each of these cases is f , while in that of D, the total spin is 
|. Therefore, if the determinant is set up in terms of D and any two 
of the functions A, B, and C, it must have the form indicated in (26), 
since, as deduced already, matrix elements between functions corre- 
sponding to different values of the spin vanish. The result is that 
the determinant may be factored into a quadratic and a linear equa- 
tion. The latter gives the energy of the other term of the quartet, 
E = H DD /d DD , while the quadratic yields the energy values of the 
two doublet levels. 
Since A + B + C = 0, we obtain the result 



BAA + HAB + H AC = 0, 
HAS + HBB + HBC = 0, 
HAC + HBC + HCC == 0. 



By eliminating from these equations, we derive the relation 
HAB = %(Bcc - HAA - HBB), 

which enables us to express the non-diagonal terms in (26) in terms of 
the diagonal elements. The latter are readily calculated. Thus, 

HAA = i(*n - *ra)ff (*n - *ra) 



= (abc/H/abc) + (abc/H/acb) - (abc/H/cab) - 

%{(abc/H/bac) + (abc/H/cba)}. 

In a similar manner we can write down the expressions for HBB and 
HCC and, consequently, for HAB- If we assume that the functions 
II, III, IV are approximately orthogonal, then we have the relations 
d A A = d>BB = d c c 1; anc * ti 16 resulting expression for the energy of 
the doublets is given by 

E - (abc/H/abc) - (abc/H/cab) ((abc/H/acb) 2 + 

(abc/H/bac) 2 + (abc/H/cba) 2 - (abc/H/acb)(abc/H/bac)~ 
(abc/H/acb)(abc/H/cba) - (abc/H/bac) (abc/H/cba)]^. (29) 

In order to analyze the physical significance of this result, let us con- 
sider the three univalent atoms a, 6, and c (each having a single s 
electron), arranged at the corners of a triangle. Let r a &, etc., designate 
the distances between each pair of atoms; and let e a i etc., designate 
the total energies of binding of the corresponding pair of atoms. Using 
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equations (1) and (2), we can express these energies in the form 

a6 = Cab 7, 
bc = Cbc =h a, 
e c a = C ca P, 

where C a b correspond to the Coulomb energy and 7 to the exchange 
energy, with similar interpretations for the other symbols. 

Each term, such as C a b or 7, is a function of the internuclear distance 
r ab , as long as the atom c is infinitely removed. We may express the 
energy in the form 

e ab = (ab/H/ab) (ab/H/ba). 

Without changing the values of the terms, we may also write this in 
the form 

e c ab = (abc/H/abc) (abc/H/bac), 

where the presence of atom c at a considerable distance from a and 6 
is indicated formally. 

In a similar manner we may express the binding energies e bc and e% a . 
But now let us consider what happens to the value of e c ab when c is nearer 
to a and b. In that case, the energy of the system is represented by E 
in equation (29). In the latter, the first term 

(abc/H/abc) = C ab + C bc + C ca . 

The second term in equation (29), (abc/H/cab), which is due to the 
cyclic permutation of the three electrons, is known as a multiple ex- 
change integral, and its actual value is small compared to the single 
exchange integrals, such as (abc/H/bac). Hence, it may be neglected 
for most purposes. 

Evidently (abc/H/acb) = a, since it corresponds to the exchange 
energy for the molecule be with a completely removed. Similarly, we 
have the relations 

(abc/H/bac) = 7, 
and 

(abc/H/cba) = 0. 

Consequently, we can write equation (29) in the simplified form 
E a bc = Cab + Cbc + Cca (a 2 + (I 2 + 7* - aft - fa - 7 )*- (30) 

As shown in the discussion of the HL theory, both the Coulomb and 
exchange energy terms are negative for all ranges of values of inter- 
nuclear distance of practical interest. Hence, C ob + 7 corresponds to 
attraction, and since 1 7 | > | C a b |> C a b 7 corresponds to repulsion. 
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For a similar reason, the positive sign in equation (30) must be used 
to indicate attraction, and the negative sign for repulsion. 

14.6 Method of Localized Pairs. The procedure described in the 
previous section is obviously quite tedious. For calculating the shapes 
of molecules and activation energies Pauling and Van Vleck have 
developed a modification of Slater's method which is more convenient 
to use. Although results obtained in this manner are only very rough 
approximations, nevertheless they are of sufficient significance for the 
purpose of physical interpretation. 

This method has been designated as that of localized pairs by 
W. G. Penney, and as that of valence-bond wave functions by Pauling 
and Wilson. The method is essentially similar to that described in a 
previous section in connection with Pauling's treatment of directed 
valence bonds. Van Vleck has also dealt along the same lines with the 
problem of the structure of CH 4 and H 2 0. 13 

As an illustration of the application of the method of localized pairs, 
let us consider the interaction of an atom C with the molecule AB } the 
problem which has been discussed in the previous section. We shall 
assume that the atoms A and B are bound by a shared-electron bond 
(two s electrons of opposite spins) and that C contains a single s elec- 
tron in the valence shell. 

Let us now bring up atom C to the molecule A B in a line at right 
angles to the axis of the molecule. Then each atom in the molecule 
interacts with the atom C, and the magnitude of this interaction energy 
is given, according to equation (3), by the relation 

ECO, + E c b Cca " + C c b ~ ' 

& i 

Hence, the total energy of the system (in the lowest state) is given by 



(31) 

In a similar manner we obtain, for the interaction of the atom A 
with the molecule BC t the total energy 

//Q J_ *A 

(32) 

18 For details see W. G. Penney, op. tit., Chapter IV, and J. H. Van Vleck and 
A. Sherman, Joe. tit. 
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and in the case of atom B and molecule AC, 

+ fi-^' (33) 

These three relations obviously represent the limiting cases for the 
energy of a system of three atoms arranged at the corners of a triangle, 
with the three interatomic distances approximately the same. We are 
interested in deducing from equations (31), (32), and (33) the energy 
of this latter system. 

Now in equation (31) the terms a and /3 are small compared to 7, 
and therefore we can write this relation in the form 



07 - ft, (34) 




where the right-hand side is derived from the expression in the center 
by omitting the terms (a 2 + j8 2 )/4 and oj8/2, under the radical sign. 

For El c and E b ac similar relations can be deduced, and since E a i> c 
must involve the exchange energy terms symmetrically, we obtain the 
relation 

E abc - EC - &{( - ft 2 + 05 - 7) 2 + (7 - ) 2 }]*, (35) 

which is identical with equation (30), except for the sign. 14 The 

db sign in the latter arose from taking into ac- 

count the spin degeneracy. The negative sign 

is used in equation (35) to indicate that we 

must use the negative values of the square r\ ^^ \a 

root, since this will yield the lowest (most 

stable) energy state. 

In a similar manner we may deduce the if 
potential energy of a system of four univalent FIQ 70 Interaction ener . 
atoms with only spin degeneracy. In Fig. 70 gj es of four atoms . 
let A, B, C, D denote the four atoms, arranged 
so that the interatomic distances are approximately the same. From 
these four atoms it is evidently possible to have the following three 
pairs of diatomic molecules: 

AB AD AC 
CD BC BD. 

Let a, ]8, 7, 8, , and designate the exchange energies for these mole- 
cules as indicated in Fig. 70. As in the discussion of the three-atom 

14 More rigorous methods for deriving equation (35) are given by W. G. Penney, 
op. cit. t p. 77, and by Van Vleck and Sherman, loc. tit. 
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problem, we obtain the relation 



and similar relations for E^ and EM, in which the three expressions 
(a + 7), (ft + 5) and (e + f ) occur symmetrically. 

By analogy with equations (31) and (35) it follows that the energy 
for the lowest state of the four-atom system is given by the relation 



E 

+ ( + f - a - T ) 2 }]*. (36) 

14.7 Activation Energy. As first pointed out by F. London, 16 equa- 
tions (35) and (36) lead to a method for calculating the activation 
energy of chemical reactions from a knowledge of the potential energy 
(Morse) curves for the molecules involved in the reactions. This point 
of view was adopted by H. Eyring and M. Polanyi 16 in the considera- 
tion of some simple gas reactions and has since then been extended 
by the former and his collaborators to the interpretation of a large 
number of chemical reactions. 17 

The activation energy (EA) of a chemical reaction is determined from 
the temperature variation of the velocity constant k by the Arrhenius 
equation 

E A ,, 

(37) 



This equation may be written in the integrated form 

_BA 
k = Z* RT , (38) 

where Z is interpreted, on a kinetic theory basis, as the number of col- 
lisions per unit time per unit volume between reacting molecules. The 
exponential term corresponds, in accordance with the Boltzmann 
theorem, to the fraction of colliding molecules which have an energy 
in excess of EA, and equation (38) is interpreted from this point of view 

15 Sommerfeld Festschrift, S. Hirzel, Leipzig, 1928, p. 104. 

16 Naturwissenschaften, 18, 914 (1930); Z. physik. Chem., B12, 279 (1931). 

17 Some of the more important papers are the following: H. Eyring, Chem. Rev., 
10, 103 (1932); J. Am. Chem. Soc., 53, 2537 (1931); 64, 3191 (1932); A. Sherman 
and H. Eyring, /. Am. Chem. Soc., 64, 2661 (1932); H. Eyring, J. Chem. Phys., 3, 
107 (1935); H. Eyring, H. Gershinowitz, and C. E. Sun, J. Chem. Phys., 3, 786 
(1935); H. Eyring, Chem. Rev., 17, 65 (1935); A. Wheeler, B. Topley, and 
H. Eyring, J. Chem. Phys., 4, 178 (1936). See also Van Vleck and Sherman, Zoc. 
cU. and W. G. Penney, " The Quantum Theory of Valence," Chapter V. 
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as indicating that only those molecules which have an energy in excess 
of the critical value E* actually react. 
Now let us consider the reaction 

C + AB -> CA + 5, 

where A, B, and C are univalent atoms with s electrons. Let a, 6, and 
c designate the corresponding single-electron functions, and let a, 0, 
and y designate the " exchange " energy terms for the molecules BC, 
CA, and AB, respectively. These symbols thus have the same sig- 
nificance as in equations (30) and (35). 

In order that the reaction may occur, the atom C must be brought 
up to the molecule AB. We can do this in at least two ways, and in 
the first of these we let C approach AB along the surface for which 
a = j8. (For the case in which A and B are identical this will be 
the plane which is perpendicular to the axis of the molecule AB.) 
Then, 

E c ab = C ab + C ca + C cb + (2a 2 + 7 2 - a 2 - 



(39) 



That is, the atoms A and B will repel the atom C with an energy 
(a + j8)/2, and at the same time the energy of binding of the system 
is decreased (E is made more positive). 

Since both a and ft increase in magnitude with decrease in the dis- 
tances TAG and r B c, it is possible for (a + ft)/2 to become equal to 7, 
and then a reaction may occur with formation of either CA or CB, 
depending upon the relative magnitudes of a and ft. In this case, 
therefore, the atom C has to acquire sufficient energy to pass over the 
potential energy " barrier " of magnitude | 7 | , and this will be the ap- 
proximate magnitude of the activation energy for the reaction. If 
the reaction results in the formation of the molecule CA, the binding 
energy for the latter is given by C ca + j3, and, neglecting the difference 
between the relatively small terms C ab and C ca > the total change in 
energy for the reaction is given by 1 7 | | |. 

Now it is evident that the reaction must actually proceed in such a 
manner that the activation energy is a minimum. As shown by 
F. London, this condition is obtained if the atom C is brought up to the 
molecule AB in a direction which is the extension of the molecular 
axis. Assuming that the reaction leads to formation of the molecule 
CA, we must permit C to approach along the axis of AB in such a way 
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that C is closer to A than to B. In that case we may neglect the inter- 
action energy a between B and C, and consequently, 



E c ab = C ca + C&a + VV + 2 - fly. 



The expression under the radical has a maximum value for = y/2> 
and under these conditions 

. fz 

(40) 



That is, the atom C decreases the binding forces between the atoms 
A and B, and the energy of the system changes from the initial value 
C ab + y (f or c at infinity) through a maximum value C & + C ac + 
7\/3/2 to a final value C ac + & (for B at infinity). The activation 
energy is evidently given to a first approximation by the magnitude 
| y | A/3/2 I 7 I = 0.134 | 7 |, which is thus considerably smaller than 
the activation energy for the atom C brought up to AB in a direction 
at right angles to that of the molecular axis. 

These considerations may be illustrated by means of Figs. 71 18 and 
72. The energy of the system, as a function of the interatomic dis- 
tances TCA and TAB, is represented 
by a surface in a three-dimen- 
sional coordinate system. In two 
dimensions, this surface may be 
represented by contours which are 
lines of equal energy, and in Fig. 
71 the point / represents the 
initial state of system in which 



(r A B)H I 



Plateau 




is the interatomic distance 
when C is at infinity. As C ap- 
proaches AB along the direction 
of the line AB, the energy of the 
system increases, and the curves 
Ra71. Energy surface (showing contour marked j, 2 , 3, etc., indicate such 
lines) for three atoms arranged linearly. . , ' , ,, . , 

a senes of contours. It will be 

observed that for large values of TCA these contours are approximately 
parabolic in shape, since they correspond to Morse curves such as that 
shown in Fig. 62 for H2. As TAB becomes greater, the energy becomes 
more and more positive and the contours become merged in a " plateau," 
as indicated in the figure. 

18 Figures similar to Fig. 71 for different reactions are shown in the paper by Eyring 
and Polanyi (loc, dt.) and in subsequent papers by Eyring and his associates. (See 
footnote 17.) 
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The final state of the system is represented by the point F, in which 
(TCA)** is the interatomic distance in the molecule CA, with the atom B 
removed to infinity. The transition from the state / to state F occurs 
with minimum increase in energy along the path indicated by the dashed 
line IFF, which is the bottom of a " valley " in the three-dimensional 
representation. This valley has a maxi- 
mum height above / in the region P, 
which thus forms a " pass " or " saddle." 
If the system has sufficient energy to carry 
it from I to P, then it may either revert 
to state I or proceed to state F. The 

activation energy thus corresponds to the FlG - 72 Relation between en- 

, , , , . , , e ,, . , r , .,, . ergies of activation and of re- 

absolute height of the point P with respect action for revcrgible reaction 

to/. 

The change in energy of the system in passing from state I to state 
F is indicated graphically in Fig. 72, where EAI is the activation energy 
for the reaction 

C + AB -> CA + B, 

EAI is the activation energy for the reverse reaction, and 

Q - E Al - E A , (41) 

is the heat of reaction. 

In order to calculate the contour lines it is necessary to determine 
the Coulomb and exchange terms as functions of TCA and TAB- In his 
calculations Eyring has assumed that these may be derived from the 
potential energy curves, obtained for the diatomic molecules from 
band spectra or other sources, on the basis that the exchange energy 
term constitutes in every case by far the largest fraction of the binding 
energy. For instance, in the case of H2, as mentioned in the first 
section of this chapter, the Coulomb energy is less than 10 per cent of 
the total energy of binding. That is, referring to equation (1), 

K = 0.9tf, 

where K is the exchange energy, and E is the binding energy. 

Now, as mentioned in Chapter XIII, E can be represented for a 
diatomic molecule, as a function of the internuclear distance, by a Morse 
relation of the form 

E(r) - -2DT* + Dr*t T , (13.12) 

where and D are obtained (in general) from data on the band spec- 
trum. Hence, K [that is, the terms a, /3, 7 in equation (35)] may be 
determined as a function of TAB or TAG in Fig. 71, and from such data 
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it is then possible to deduce a value for the minimum energy of activa- 
tion for a given reaction. 

A comparatively simple reaction which illustrates these considera- 
tions is that between atomic hydrogen and para- or orthohydrogen in 
accordance with the equation 



H+ 



o-H 2 



in which the directions of nuclear spin in the two molecules are indicated 
by the arrows. As mentioned in the previous chapter, the proportions 
of the two modifications in an equilibrium mixture vary with the 
temperature. Transition from one form to the other cannot occur spon- 
taneously, but only through the intermediary of a third substance, as 
for instance atomic hydrogen. 

From a study of the reaction velocity as a function of the temperature 
the activation energy for each of the opposing reactions may be deduced 
and compared with that derived from potential energy curves. Since 
the complete details of the experimental observations and calculations 
are given by A. Farkas, 19 it is only necessary in this connection to 
mention that the agreement obtained between the experimental and 
theoretical values of EA is quite satisfactory. 

-* r in A 




no 

FIG. 73. Potential energy (Morse) curves for H 2 , HBr and Br2. 

The reactions, involving halogens, of the type indicated by the 
equations 

H + HBr->H 2 +Br 
H + Br 2 ->HBr + Br, 



19 " Orthohydrogen, Parahydrogen and Heavy Hydrogen," Chapter IV. 
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have been investigated in a similar manner by Eyring and his associates. 20 
Figure 73 21 shows the Morse potential energy curves for the molecules 
involved in these reactions from which values of exchange terms may 
be derived as functions of internuclear distance. Although the agree- 
ment between the observed and calculated values of activation energy 
is not as good as might appear desirable, there is no doubt that this inter- 
pretation of activation energy is fundamentally sound. 22 

14.8 Resonance Energy. The method of " localized pairs " has also 
been applied by L. Pauling and other investigators to a type of problem 
in regard to molecular structures, for which chemists have been unable 
to obtain a definite solution by other methods. 

Let us consider, for instance, the case of four equivalent univalent 
atoms with only spin degeneracy. There are available 2 4 = 16 unper- 
turbed functions, 23 but only six of these functions have a value zero for 
the total spin. These will correspond to the bonds drawn thus: 

Structure (A ) a : b ; c : d 
Structure (B) a : d ; b:c 
Structure (C) a : c ; 6 : d. 

These may also be represented thus : 



(A) 



(B) 




As shown by G. Rumer, 24 only two of these structures, viz., (A) and 
(B), are independent, since (C) can be formed out of the other two. 
The relation between the three structures can be illustrated best by a 

20 H. Eyring, J. Am. Chem. Soc., 63, 2537 (1931). 

21 Eyring and Polanyi, loc. tit. 

22 Although the criticism of Eyring's particular assumptions by A. S. Coolidge and 
H. M. James [/. Chem. Phys., 2, 811 (1934)] may be justifiable, it seems to the 
writer that no arguments have been advanced which would contradict the funda- 
mental aspects of this interpretation of activation energy. The reader should also 
consult, in this connection, the more recent publications by Eyring and his asso- 
ciates on the " activated complex." 

23 J. C. Slater, Phys. Rev., 38, 1109 (1931). 

2 *G<Minger Naehr., p. 377, 1932; see Pauling and Wilson, "Introduction to 
Quantum Mechanics," p. 375. 
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vector diagram in the following manner: 25 




By this procedure, as Pauling points out, " Any structure can be 
resolved into structures involving no intersecting bonds." In conse- 
quence of this relation Pauling designates (A) and (B) as a canonical 
se^ and describes the system by the molecular orbital 

* = a^A + Vl - a 2 fa. (42) 

The interpretation of this relation is identical with that assigned to 
analogous relations in Chapter XII. The system behaves as if con- 
stituted of two types of structures, and the relative probabilities of 
occurrence of the structures are given by the ratio a 2 / (I a 2 ). 

Let a, 0, 7, 8 denote the exchange terms for the molecules, as indi- 
cated above. If we neglect the exchange terms between diagonal 
atoms and let 5 = ft = y = a, then equation (36) leads to a binding 
energy for the system 

E = EC + 2*. (43) 

Comparing this with the value C & + a. which would be valid for 
either of the structures (A) or (B) above, it is seen that the very possi- 
bility of representing the system by a wave function of the form given 
in equation (42) leads to an increase in the energy of binding. This 
increase, which is given by a in this case, is known as the resonance 
energy. Resonance occurs whenever a molecule can be represented by 
more than one equivalent structural formula, that is, when there is 
more than one way of joining up the bonds. The resonance energy 
is a measure of the increase in stability of the system which results from 
the fact that the molecule can apparently " resonate " between two or 
more equally reasonable structures. The analogy between this phe- 
nomenon and the increase in energy of two oscillators in virtue of their 
interaction is evident. 

For systems involving more than four equivalent atoms, special 
methods for the approximate determination of the resonance energy 
have been developed by L. Pauling 26 and also by H. Eyring and 
G. E. Kimball. 27 

26 L. Pauling, /. Ckem. Phys., 1, 280 (1933). 

26 Loc. tit., also " Introduction to Quantum Mechanics," pp. 374-382. 

27 J. Chem. Phys., 1, 239 (1933). 
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One of the most interesting problems to which these methods have 
been applied is that of the structure of benzene, a problem which was 
first treated by E. Hiickel. 28 Chemists have argued for a long period 
regarding the proper representation of the molecule, and, as is well known, 
different structures have been suggested which are shown in Fig. 74a. 
Of these, the best known is that of Kekute, but very good reasons have 
also been advanced for the adoption of the other structural formulas. 







Kekule' 



Claus 



Armstrong- 
Baeyer 



Dewar 




Ladenburg 








FIG. 74. (a) Suggested valence-bond structures for benzene ring. 

(6) Five canonical valence-bond structures for benzene, leading to res- 
onance phenomenon. 

As shown by Pauling the whole argument is resolved by the assumption 
that in the normal state the molecule may be represented by a mixed 
eigenfunction of the form 

* = aftfc + to) + btyc + to + to), (44) 

where the different component functions refer to the five canonical 
structures shown in Fig. 746. Equation (44) indicates, first, that the 
molecule may be regarded as possessing the properties of each of the 
five structures; second, that structures A and B are equally probable; 
third, that C, D, and E are also equally probable; and last, that the 
group A + B has a different probability from that of the group C + D + E. 
Thus, we may consider that the fraction of the time during which 
the molecule exists in state A is a 2 /(2a 2 + 3& 2 ), and the rest of the 
time in the other states. In other words, the solution given by quantum 
mechanics to a problem which has vexed chemists for many decades is 
that everyone who suggested a solution was neither completely right 
nor completely wrong. 

28 Z. P%si/b, 70, 204 (1931). 
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According to L. Pauling and G. W. Wheland, 29 the values of the 
coefficients in equation (44) are a = 1, 6 = 0.4341. Designating the 
single exchange integral for two adjacent hydrogen atoms in the ring by 
a, the conclusions deduced for the value of the resonance energy are 
shown in the following table. 

Total Resonance 
Energy Energy 

Single Kekute structure Q + 1 . 5a 0.0 

Resonance between A and B Q -f 2 . 4 . 9<* 

Resonance among all five structures Q + 2 . 6055<* 1 . 1055<x 

The exchange energy term for a single Kekul6 structure would be 
1.5a, where a is about 1.5 v.e. (== 34,580 cal.), and the added resonance 
energy term is 1.1055a, that is, about 38,200 cal. /mole, " of which about 
80 per cent is due to the Kekul6 structures alone." 30 

In a more recent publication, E. Hlickcl 31 has reached the conclusion 
that in equation (44) the values of the coefficients are a = 0.450, 
b = 0.147. On the other hand, he deduces the same value for the 
resonance energy as Pauling and Wheland. 

The latter have also applied the same method to the calculation of 
resonance energies for other ring compounds, such as naphthalene and 
anthracene, and a large number of other organic molecules. Although 
the absolute values derived for the resonance energy may be doubtful, 
owing to the rather approximate methods of calculation, the resonance 
phenomenon is unquestionably of fundamental importance in leading 
to increased stability of molecules. 

29 J. Chem. Phys., 1, 362 (1933). 

30 From their very accurate determinations of the heats of organic reactions, G. B. 
Kistiakowsky and his co-workers, J. Am. Chem. Soc., 58, 146 (1936), deduce a value 
of about 36,000 cal. for the resonance energy of benzene. However, they find that 
in general the resonance calculations of Pauling are not in satisfactory agreement 
with the experimentally observed values. This, of course, merely indicates that it 
is at present impossible by the methods used in quantum mechanics to derive even 
approximately correct values of resonance energies. 

31 J. Phys. Radium, 6, 347 (1935). 



CHAPTER XV 
QUANTUM MECHANICS THEORY OF RADIATION 

15.1 Classical Theory of Radiation, The electromagnetic theory of 
Clerk Maxwell led to the conclusion that radiant energy is propagated 
as a wave motion. The simplest model of a source for the production 
of such waves is an electric dipole with fixed axis and periodically 
variable moment M , where 

M = ex = exo cos 2irvt. (1) 

In this equation x represents the maximum amplitude of the os- 
cillating charge, and v, the frequency of oscillation. 

Such a dipole is known as an electrical oscillator or Hertzian vibrator 
and finds its macroscopic representation in a short linear antenna such 
as is used for the radiation of very high-frequency radio waves. 

According to electromagnetic theory an electric charge moving with 
uniform velocity does not radiate energy. If, however, the charge is 
accelerated, the instantaneous rate at which energy is emitted is given 
by the relation 

dE _ 2e*/d 2 x\ 2 
" dt ~3c 3 U 2 / ( } 

where x is the coordinate along which motion occurs (the axis of the 
dipole), and c is the velocity of light. 

From equations (1) and (2) it follows that for the linear harmonic 
oscillator 

dE 2(2)* 2 2 2/0 . 
~ lit = 3c 3 e XQ cos ( 27r *0. 

The average rate of emission of energy per single oscillator is 

dE 



since the average value of cos 2 (27n>0 is J, and it will be noted that the 

frequency of the radiation emitted is identical with that of the oscillator. 

In general, the charge will not be concentrated at any one point, but 

will be represented by a distribution function p(x, y, z). For the case 
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of a negative charge distribution about a positive charge located at the 
origin (as in atomic systems), the electric moment of the charge dis- 
tribution will be represented by the vector quantity (see Appendix IV) 

= e J rp(x, j/, z)dxdydz cos (2irvt), (3) 

where r is the vector distance from the origin. That is, r indicates both 
the magnitude of the distance of the element of volume and the direction 
in which this distance is measured. If we represent the components of 
the moment along the three axes of coordinates by M*, M y , and M, 
the intensity of the radiation for which the direction of polarization is 
along the #-axis is given by 

Jx = g 3 ' MS, (4) 

and similar relations will apply to the components for which the direc- 
tions of polarization are along the other two axes of coordinates. Con- 
sequently, the average rate of emission of radiation for all three com- 
ponents is given by 



J* + J v + J* - r {M + M* + M, 2 }. (5) 

In the case of a charge moving in some form of periodic orbit, as was 
postulated in the Bohr theory, the motion along any coordinate q can be 
represented by Fourier series of the form 

q = AQ + AI cos 2irvt + . . . + A n cos 2mmt + . . . 

+ BI sin 2vvt + . . . + B n sin 2irvnt + . . . , (6) 

where, if q is real, A n and B n are real magnitudes. 
We can also express q in the exponential form 

q - LC n e 2 "", (7) 

n 

where n has all integral values (including 0) from oo to +. This 
is necessary, since, as shown in Chapter II, 



= C n cos 2irnv t + iC n sin 2mvt + 
CL n cos 2-irnvt iCL n sin 2wnvt 
= A n cos 2wnvt + B n sin 2irnvt, 

where 2C n = A n iB n 

2C- n = A n + iB n . 
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Consequently, C n = CL n , and 

n .n ri .n _ I ri 12 _ C^n + B n ) 

^n Cn C n C_ n I '-'n 4 

4 

From equation (6) it follows that, for this case, the intensity of the 
radiation, that is, the average value J of the energy emitted per unit 
time, is given by the relation 



r M 

~" 3^3 -- 9 9 (o) 

where q 2 = average value of q 2 . From the orthogonality relations for 
the trigonometric functions it follows that 



(9) 

n 

and consequently, 

'-HE?** ! do) 

where n assumes all integral values from + 1 to +00 . 

Again the fundamental frequency v and the harmonics nv will appear 
in the radiation, and the magnitude of the coefficient | C n \ 2 will be a 
measure of the relative intensity of the corresponding harmonic in the 
radiation emitted. 

The Bohr theory of the origin of spectral lines represented a radical 
departure from this point of view since it postulated that the spectral 
frequencies are obtained as a result of transitions between two levels 
in accordance with the relation 

(E n E m ) 

Vnm = - - -- (11) 

Consequently, optical frequencies are not identical, on the basis of 
Bohr's theory, with frequencies of revolution. However, when we are 
dealing with orbits of large quantum number, the frequency of the 
radiation emitted as a result of transitions between adjacent orbits 
becomes approximately the same as that of the motion in either orbit. 1 
That is, we have the approximate relation 

v = An w, 

where v is the optical frequency, o> is the frequency of revolution of the 
1 See discussion in Taylor's Treatise, pp. 1179-1181. 
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charge, and An - 1, 2, etc. Thus An corresponds to the integer n in 
equations (9) and (10), and it is seen that under these conditions (that 
is, in which v is associated with transitions between orbits of large 
quantum numbers) the deductions of quantum theory are in agree- 
ment with those expected on the basis of classical theory. 

As emphasized by Bohr this correspondence or coincidence in the 
values of spectral frequencies as calculated from the two points of view 
cannot be accidental. The coincidence must also extend to ampli- 
tude (or intensity) of emitted radiation and direction of polarization, 
and Bohr showed that on this basis it is possible to deduce selection 
rules for the probabilities of transition between energy levels. This 
idea, which has been designated by Bohr as the Correspondence Prin- 
ciple, has also been carried over into quantum mechanics, and may be 
stated thus: For systems in which the product of momentum p and 
associated coordinate q is very large compared to ft, the deductions 
of quantum mechanics must tend to become identical with those derived 
from classical theory. It will be recognized that the condition pq^h 
is fulfilled by Bohr orbits of large quantum number. 

16.2 The Schroedinger Equation Involving Time. In order to inter- 
pret the method developed in quantum mechanics for calculating dipole 
moments corresponding to transitions, it is necessary to consider once 
more the derivation of the S. equation as given in Chapter II. 

Starting with the differential equation for a wave propagation in the 
form 



where ^, the " amplitude," is a function of the configuration space 
coordinates and of t t a solution was postulated of the form 



, - rtft)*- 2 '*', (13) 



where g* denotes the generalized coordinates, and v designates the 
frequency of the de Broglie wave. This solution satisfies equation (12), 
and if we multiply each term in the S. equation 



by e~ 2T<>1 ', we obtain the differential equation 

*. (H) 
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Let us now assume the relation 

E = hv, (15) 

thus defining the frequency v, about which nothing definite was stated 
previously. The justification for this procedure depends, of course, 
upon the validity of the subsequent deductions, since we cannot deter- 
mine, by any direct experimental methods, the value of v for the 
de Broglie wave. Moreover, it is evident, since we always observe 
differences in energy, that we might also assume 

E = constant + hv. 

Actually de Broglie postulated the relation 

E + /i c 2 

" = 

where /*o is the so-called " rest-mass " of the particle and E + w>c 2 is 
the absolute energy on the basis of the special theory of relativity. 
However, it is evidently simpler to postulate equation (15). 
It follows from equations (13) and (15) that 

d$ 



and, hence, equation (14) becomes 

2 8* 2 h 

V w 

* 



--- -- 

h 2 2 h 2 dt h dt 

For the conjugate complex function $, we have the similar relation 



Hence, 

i(M--r-. W+ - ?vV). (is) 

at 47^jLt^ 

If now we multiply both sides by dr, the element of volume in the con- 
figuration space, and integrate, we obtain the relation 

- ?vV)<*r. (19) 



But in accordance with Green's theorem (see Appendix IV) 
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where the integral on the right-hand side represents integration over a 
" surface " bounding the " volume " in which the left-hand integration 
is carried out, and d^/dn is the gradient in the direction of the normal 
to this surface. At the limits of the configuration space, ^ and $ must 
vanish. (This is one of the conditions which the S. eigenfunctions 
must satisfy.) Hence, the right-hand side of equation (19) must 
vanish identically, and consequently, 



0. (20) 

au 

That is, 

I frf/dr = constant. (21) 

It is this result that permits us to interpret ^ as a density distribu- 
tion function and to normalize ^ so that 

r = 1. 

From equation (16) it follows that for V = 0, that is, for particles 
moving in a zero field 

(22) 



h at 

This equation has the same form as the diffusion equation [see Ap- 
pendix IV, equation (41)], with - : = - = D, the " diffusion 

4irfJtt 4717* 

constant." 

Now let us consider the solution of equations (16) or (17). Because 
of the linearity of the equation, the general solution has the form 



* , (23) 

n 1 

where <f> n and E n are the eigenfunction and eigenvalue associated with 
the nth state, and c n is an arbitrary constant. 
Similarly, 

$ = Ec n ?n h . (24) 

n 

That is, ^ (or $) is represented by a superposition of wave functions, 
each of which corresponds to a definite energy state. 
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Hence, 

_2irf(gn-gm)t 

fa - L5C$nn + EcC n $ m n e * , (25) 

n n,m 

where the second summation extends over all values of n and m which 
are not identical. If we multiply each term in this equation by dr, and 
integrate over the configuration space, the integrals involving the ex- 
ponential terms vanish, since 



$n<t>mdr = 1 forn = 7ft, 
= forn 5* m. 
It follows from equation (25) that 

- LI c n I 2 . (26) 

We can interpret the quantities | c n | 2 as the relative probabilities of 
finding the particle in the corresponding alternative states, and as in 
equation (21) we can normalize the function ^ by the condition 

LI Cn | 2 = 1. 

Let us now consider the exponential terms in equation (25). The 
individual terms can be arranged in pairs of the form 

where hv nm = E n - E m = -hv mn . (28) 

Since c w c n $ m < n and c m c n <t>m$n are complex conjugates, we can write 
the expression in (27) in the form 

p(cos 2wv nm t + i sin 2irv nm t), (29) 

where p is a charge distribution function. 

It is thus evident that ^ is a periodic function in t, with a series of 
"interference" frequencies given by equation (28). These are, how- 
ever, the very frequencies which occur as transition frequencies in the 
Bohr theory. Hence, we conclude that \fy is a quantity which may func- 
tion as a source of electromagnetic vibrations, giving rise to the mono- 
chromatic radiations postulated in the Bohr theory. 

On the basis of this deduction it becomes possible to translate the 
classical theory of radiation, as outlined in the previous section, in 
terms of wave mechanics. 
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15.3 Quantum Mechanics Expressions for Dipole Moment and In- 
tensity of Radiation. Referring to the classical expression for M x , 
that is, the z-component of the dipole moment, given in equation (1), 
we shall write the quantum mechanics analog for the emission of fre- 
quency vnm = (n ~ #m)A, in the form 2 



^. (30) 

It will be recognized that the two integrals on the right-hand side of 
the last equation constitute a pair of complex conjugated matrix ele- 
ments which may be designated by the symbols x nm and x mn respec- 
tively. Hence, 



(31) 



dt* 



In order to derive the corresponding expression for ( J mn )x> the inten- 
sity of radiation emitted by the z-component of the dipole moment, it is 
necessary to calculate the time-average value of the square of the expres- 
sion on the right-hand side of equation (31). 

Evidently, 

= 2 / X 



/' 
/ 



x$ m <t> n dr 



(32) 



Now the average value of / n * ivt dt over one or more complete 

cycles is the same as that of I cos nirvtdt or / sin nirvtdt, and is 

evidently equal to zero. Furthermore, as mentioned already, the 
matrix elements x mn and x nm are complex conjugates irrespective of 
the exponential terms. Hence, the relation for intensity of radiation 
assumes the form 

12 



J - |? 



(33) 



2 The derivation that follows is to be regarded as a somewhat plausible argument 
for the use of equation (30) . For a more rigorous derivation the reader should consult 
Pauling and Wilson, " Introduction to Quantum Mechanics,' 1 Chapter XI. 
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This equation is the wave mechanics analog of equation (4). From 
this and the similar expressions for the j/- and ^-components of the dipole 
moment, it is possible, as in equation (5), to calculate the components 
of the radiation intensity with respect to different directions of polariza- 
tion. 

While these relations enable us to calculate the magnitudes of the 
radiation intensities in terms of the matrix elements built up out of the 
corresponding eigenfunctions, they are specially useful in deducing 
so-called selection rules. 

If in any given case the matrix element x mn is found to vanish identi- 
cally we conclude that the corresponding component is not present in 
the emitted radiation, and if this same result is derived for y mn and z mn 
then we conclude that the corresponding transition cannot occur. 

The integrals x nm and x mn are evidently similar to integrals en- 
countered in the discussion of the perturbation theory, of the type 

Jmn 

where / is a function of the coordinates. Thus, in Chapter IX, / is 
identical with V, the potential energy function. These integrals were 
designated matrix elements, and it is evident, since the matrix elements 
defined by equation (31) are complex conjugates, that the matrix is, 
again, of the Hermitian type. 

15.4 Selection Rule for Linear Harmonic Oscillator. A relatively 
simple case in which the matrix elements associated with a transition 
from one state to another may be calculated readily is that of the 
linear harmonic oscillator which was discussed in Chapter V. It was 
shown there that, omitting the time factor, 



rxr x2 H n 

/ 00 



(34) 
is equal to zero, unless m = n db 1. For m = n + 1, 




and for m = n 1, 

Sn.n-1^- (36) 

That is, in the case of the linear harmonic oscillator only those transi- 
tions can occur for which the quantum number changes by +1 or 1. 
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We have thus derived a sekction rule for transitions between energy 
states of a linear harmonic oscillator. Furthermore, the radiation 
emitted in a transition between adjacent states must be linearly polar- 
ized and the direction of polarization must coincide with that of the 
motion of the oscillating particle. 

According to equation (33), the intensity of the radiation associated 
with the transition n > n 1 is given by the relation 



/n,n 1 



- 4e 2 



3c 3 



where according to equation (5.7) 



(37) 



Hence: 



Similarly, it follows that 



(27n/) 4 2ne 2 

"-'""I?"' b ' 

(27n>) 4 2(n + l)e 2 



n ~~ 3c 3 



(38) 



(39) 



It is of interest to compare these results with the conclusions from 
classical theory on the basis E n = (n + %}hv. In that case, the value 
of the maximum amplitude for the state n is V2n + 1 XQ, where 
XQ is the maximum amplitude for the state n = 0, while that for the 
state V2n 1 is \/2r& 1 XQ. Therefore, the classical value for the 
intensity of a transition from the level n to the level n 1 would 
be between 

s (27T,) 4 



(2n 



3c 3 



and 



For large values of n it is evident that this agrees with the conclusions 
stated in equations (38) and (39). 

15.5 Selection Rules for Quantum Numbers m and /. In Chapter VI 
it was shown that the eigenfunction for the rigid rotator with fixed 
axis is given by 



- 

V27T 



(40) 



where \m\ = 0, 1, 2, etc. 
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The ^-component of the center of gravity of the charge consists of 
terms of the type 

Xnm = fxZ m Z n dr, - 2 ^"*n-*> '. (41) 

If we assume that the rotator is represented by a charge e moving on 
a circle of radius r, we obtain the relations 



x = r cos *n = - (^ + ~~ il7 ) 



y = rsinij - ( - 



(42) 



Omitting the time factor, it follows from equations (40), (41), and 
(42) that 



where the integration is over the range ^ t\ ^ 2ir. 
Similarly, 



In equation (43) the first integral vanishes unless 

m n + 1 = 0, 

and the second integral vanishes unless 

m n 1 = 0. 

In either case the integral is equal to 27r, and 



while 



= ~ forn = m 1, 



= ^ for n = m + 1 



f or n = m 1 



(45) 



(46) 



We have thus derived a selection rule for the rotator, that only those 
transitions can occur for which Am = 1. In other words, in the case 
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of the rigid rotator with fixed axis, the only transitions permitted are 
those between adjacent states. 

To determine the direction of polarization of the emitted radiation, 
we note that 



= er cos 2irvt. 

Similarly, the relations are deduced 

M x (w + 1, m) = er cos 
Mj/(m 1, m) = er sin 
and M y (w + 1, m) = er sin 2irrf. 

These relations show that the radiation emitted and absorbed is 
circularly polarized, since the x- and ^-components of the electric mo- 
ment are equal for each transition. For the spontaneous transition 
m + 1 > m, the light emitted is circularly polarized in the direction 
of the rotation, and for the transition m 1 > m, the light absorbed 
is also that which is polarized in the direction of rotation. 

It also follows from equation (45) that 



, x 
J x (m 1, m) = ^3 -- (47) 

We shall now consider the rigid rotator with free axis. The eigen- 
functions for this system, as derived in Chapter VI, are given by 



The dipole moment corresponding to transition from state I, m to 
state I', rnf is given by 



M(?, w, l f , m f ) = J rYi 



in which the term involving t has been omitted, and r is a vector which is 
defined in terms of x, y, and z. 
Hence, the x-component of the dipole moment is given by 

M * - ^2 ( * sin e ' p r(cos 0) P$ (cos 0) sin Od6 

N Jo 

f 2 *cosije <m T <fB ' 1 'du, (49) 

Jo 
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where 1/N designates the coefficient in equation (48). Similarly, 

M y = -^ /"sin Pf (cos 0) PJ?' (cos 0) sin 0d0 

</V t/o 

/2ir 

I atn M . ^^^"~"*"*''7/7(i* ^(1^1 

S1H W e C"ff W^/ 

t/O 

and 

67* /* T 

M, = -5 I cos Pf (cos 0) PJJ' (cos 0) sin 0d0 

^V /o 

/27T 

Jo 

Let us first consider M^. Since the integral involving ry vanishes for 
m & m', we need to consider, in calculating M 2 , only those transitions 
for which Am = 0, and consequently the integral involving TJ is equal to 
2ir. Hence, 

M * - -^ ' 2* I cos e ' * JT(cos 0)PP' (cos 0)d0. (51) 

IV t/O 

Now in any treatise on Legendre functions the following recursion 
formula is deduced: 

(21 + 1) cos Pr(cos 0) = (I + m)P?L 1 (cos 0) 



Substituting from this relation in equation (51) it follows from the 
orthogonal properties of the function Pf*(cos 0) that M^ vanishes unless 
I' = I 1. 

By evaluating M and Mj, it is found that light polarized along these 
axes is emitted (or absorbed) only when m f = m 7* 1. This con- 
clusion has already been deduced in the consideration of the rigid rotator 
with fixed axis. 

We have thus deduced the following selection rules: Only those 
transitions can occur for which 

Am = or 1, and AZ = =fcl. 

In the case of the total quantum number n, it is found that there is no 
similar selection principle. Transitions are permitted between any pair 
of values of this number. 
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15.6 Wave Packets. Group Velocity. In this section we shall 
consider some further consequences of the S. equation (18) involving 
time. It will be recognized that this differential equation must in some 
manner give us the same kind of information about the function ftp 
as is obtained in classical mechanics from the equations of motion. As 
shown in Chapter IV, the latter may be expressed in terms of the Hamil- 
tonian function H = H (pi, q t ) by the canonical equations 

*-*- 

, ^ . dH 

and -TT s p, = - (53) 

at oqi 

Given the initial values of p ly qi and the form of the function H(pi, qi) 
it is possible, by solving these canonical equations, to predict the mag- 
nitudes of the variables pi and qi for any subsequent value of t. In 
view of the validity of the Principle of Indeterminacy it is evidently 
impossible, as stated previously, to carry out such precise calculations 
in connection with systems for which the product piqi is of the same 
order of magnitude as h. However, we do know that, as the magnitude 
of the product p l qi increases to values which are considerably greater 
than ft, the methods of classical mechanics become more and more 
applicable. This is, in fact, the essential significance of the Corre- 
spondence Principle. Therefore, in the formulation of quantum 
mechanics we must be guided by the criterion that in the limit of large 

quantum numbers, that is, for large values of 9 ftdft, the results 

obtained by the new theory must tend to become identical with those 
derived by classical mechanics. 

The essential difference between the classical and quantum mechanics 
is, of course, due to the fact that the purely corpuscular aspect of 
motion is no longer adequate for the description of atomic systems. 
The observations on reflection and refraction of electrons make it neces- 
sary to regard the behavior of these particles as somehow associated 
with " waves," the de Broglie waves. Furthermore, as Heisenberg 
demonstrated, this involves a radical revision of our notions regarding 
the significance of coordinates and momentum variables in the case of 
atomic systems, and he formulated this new point of view in his Principle 
of Indeterminacy. 

On the basis of this principle a homogeneous beam of electrons travel- 
ling in a given direction, that of the x-coordinate, possesses a definite 
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momentum p, and we represent this state of affairs by the eigenfunction 



where a = 

A n 

E 

and v = 

n 

The real part of the complex function ^ corresponds to a sine or cosine 
curve extending from <*> to + oo . When we inquire regarding the in- 
stantaneous position of the electron, the answer given by quantum 
mechanics is that, while we are completely ignorant of this coordinate, 
it is possible to predict the probability of occurrence of an electron per 
unit length of its path. 

Similarly, we may define the instantaneous position x of a particle. 
But in that case, the velocity may be any value from zero to infinity. 
How then can we represent a " localized " particle? In the language of 
quantum mechanics this means that the form of the function ^ is to 
be such that it has physically significant values over only an extremely 
narrow range of values of x. The best illustration of such a function is 
the probability curve represented by the relation 

y = 6e- c2 * 2 . (54) 

The plot of this function has the form shown in Fig. 24 for the eigen- 
function of the linear harmonic oscillator in the state n = O. 3 This 
curve is symmetrical about the origin, and for x = =fc 1/c, the value of y is 
1/eth, that is, 0.3678 of its value for x = 0. Evidently, 1/c is a measure 
of the " spread " of the curve. The greater the value of c, the smaller 
the value of | x \ for which the value of y decreases to any given fraction of 
its value for x = 0. 

On the basis of Fourier's theory, any function which is continuous and 
single valued over a range of values of x may be represented by a series 
involving cosine or sine functions. 4 Each of these functions will 
represent a stationary wave of definite wave length X, which corre- 
sponds, therefore, to a definite momentum p. Now if we wish to repre- 
sent the probability function W by such a series it is found that the 
number of terms required to represent the function ^ with any degree 

3 See Chapter V, supplementary note 1, also J. W. Mellor's " Higher Mathe- 
matics." 

4 See Chapter VI, supplementary note 1. 
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of approximation increases with increase in the coefficient c in equa- 
tion (54). In terms of quantum mechanics this means that, the more 
we attempt to minimize Ax in the representation of \$, the greater the 
range of values of p which are required in forming the " wave packet " 
for^(z). 

Thus, the sense in which the term " wave packet " is used in quantum 
mechanics is essentially the following. In order to obtain a function 
^ which shall localize the particle in a relatively narrow region of the 
configuration space, we represent ^ by a summation of trigonometric 
terms each of which corresponds to the amplitude (as a function of 
coordinates and time) of a de Broglie wave of definite wave length and 
frequency. That is, we can always represent a given ^ by the series 



in which the values of the coefficients are obtained by means of the 
relation 



a n - J *r**dx, 

and the actual number of terms will depend upon how accurately it is 
desired to approximate the function ^ in a region x% >x >Xi. 

We shall now deduce the relation between the velocity of the particle, 
regarded as a corpuscle, and the velocity of propagation of the de Broglie 
waves which constitute the wave packet. For this purpose let us con- 
sider the simple wave train 

y = l/o cos 2ir(vt <rx), (55) 

in which v = frequency, and a = 1/X = wave number. If we super- 
pose on this wave another wave of the same amplitude (j/o) but slightly 
different frequency v + Ai/ and slightly different wave number <r + ACT, 
the amplitude of the resulting motion is given by the relation 



y = 2/0 cos 2w(vt ax) + y Q cos 2v[(v -f Ai/) - (a 
= 22/o cos 2w(vt ax) cos ir(&vt Aex). (56) 



The equation indicates a train of waves of frequency Av and wave 
number ACT which is modulated by the other wave of much higher fre- 
quency and higher wave number. The maximum amplitude travels 
with a velocity v defined by the relation 

x AP dv 

v SB - a in the limit. (57) 

t A<T Off 
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On the other hand, the individual waves travel with velocity 

v 

u = - 
a 

The latter is designated the phase velocity, and v, the group velocity. 
Equation (57) is well known in the theory of wave motion and is often 
written in the form 

u 1 v du 

dv u u 2 dv 

which gives the relation between the two magnitudes u and v. Evi- 
dently, v = u only if du/dv = 0, that is, if the phase velocity is inde- 
pendent of the frequency. This is true of electromagnetic waves in a 
vacuum. Therefore, under these conditions, the phase and group 
velocities are equal. On the other hand, the group velocity of waves 
propagated in water, is one-half the phase velocity. That is, the in- 
dividual waves travel faster than the group. 

In a wave motion, energy is transferred only at the speed defined by the 
group velocity. Consequently, in the case of de Broglie waves, the group 
velocity is identical with that of the particle. The phase velocity is a 
fictitious magnitude, since it is impossible to observe for de Broglie 
waves a frequency v. 

This conclusion may also be stated thus: For the corpuscle regarded 
as a wave motion 

E , h P /cox 

v = h> X= V '"*' (5) 

But in accordance with equation (57), the group velocity is given by 
the relation 

dv dE fm \ 

V = d^ = d^' (59) 

which corresponds to the canonical equation 



If now we represent E as a function of p for any set of particles in 
motion, then the corpuscular velocities are determined for each value of 
E by the value of the slope dE/dp. 
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15.7 Relation between Eigenfunction Representation and Particle 
Velocity. We shall now consider some further deductions from the 8. 
equation involving time. For this purpose it is convenient to write 
equation (16) in the form 



ft A-.. \ X~2 ~'TJ9 (Ol) 



and equation (17) in the form 

(62) 



where ^ = $(x, <) and a = 8ir*M/& a . 
Hence, we derive, as before, equation (18), that is, the relation 



- 

4ir,iV dx 2 dx 2 



or 

dp h d / T d$ d$\ , . 

= ( if ^ I? (63) 

dt 4via dx\ dx dx/ 



where p = ^, is the " density." 

In order to realize the significance of this equation let us consider the 
motion of particles along the z-coordinate. Let p designate the density 
per unit length, and j, the " current," that is, the rate at which particles 
pass a given point. If dp/dt = 0, there is obviously no variation in j 
with distance. However, if the current flowing into the section Ax is 
different from that flowing out of it, there must be a change in density, 
in accordance with the relation 

dp d J 



Comparing (64) with (63) it follows that the current density at x is 
defined in quantum mechanics by the relation 



(65) 

Hence, the average velocity in the x-direction is given by 

.aw (66 , 

dxj 
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More generally, if ^ is a function of the configuration space, 

(67) 



where the " gradient " is the rate of change of ^ along the normal to the 
" surface " at any point. (This result may be derived directly from the 
divergence theorem which is discussed in Appendix IV.) 
Equation (66) may also be derived from the definition of the velocity 

(as) 



where designates the average value of x. 
Hence, 



(In these and subsequent equations, it will be understood that the 
limits of integration are >.) Substituting from equations (61) and 
(62) the last relation becomes 



efc ih r /- aV , &V\, 

= - - I X(\ff -T-o- ^ 'T^)^ 

dt 4ir/iJ \ dx 2 dx 2 / 

Now, 



Hence, 



- 

dx J dx dx 

But, o$ (<ty/dx) vanishes at the limits. Hence, we derive the re- 
lation 5 for the particle velocity in the form 



which is identical with equation (66). 
8 Instead of the coefficient /i/(4ir/u) we can evidently use the coefficient -ih/(4*n). 
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It Mows that the average value of the momentum is given by the 
relation 



..<** (69a) 

2* dx 



..& (69&) 

2iri J dx 

The last equation is one which follows, as shown in section 2.7, from 
the operator method for deriving the S. equation, since 

/h f- ty 
ft* te ~ariJ**te' <fe< 

From equations (69a), (61), and (62) it follows that 

tf* = jh_d_ fo.^.dx 
dt 2 ZrpdtJ dx' 

-*LC d J?+.W dWl 

~torpJ Ldt dx^ V dtdx] 

ih 



ih ih r, T/aV VI \W 

- I da; I s otVy l T 

2*-M **nJ L\3x 2 Va* 



Now ' 

Also, ^ (d^/dx 2 ) vanishes at the limits. Hence, 

dF 
'*' 
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That is, 

d 2 $ /W\ 

M ' d? 88 " " \dx)' ^ 

where the dash over the right-hand bracket indicates that the average 
value of dV/dx is to be used. Evidently, equation (70) corresponds to 
Newton's second law of motion, and it has thus been shown that the 
relations of quantum mechanics are in agreement, for extremely narrow 
wave packets, with those deduced in classical mechanics. 

15.8 Spreading out of Wave Packet in Time. It is of interest, in 
order to illustrate Heisenberg's principle, to consider the inferences that 
may be deduced from this principle regarding the variation with time, 
in the form of a wave packet. Equation (25) shows that for a particle 
in a zero field, confined to move along the ^-coordinate, 



ih 
where D = - (72) 



The solution of this differential equation gives \p = if/(x, t) and there- 
fore makes it possible to determine the behavior of fa as is increased 
from t = 0. 

For real values of D, equation (71) is the well-known partial dif- 
ferential equation for diffusion, and the simplest form of solution is that 
given by the expression 6 



* to0- 5SpT (73) 

where T and C are constants. If we designate the exponential factor 
by <t>, we obtain the following relations : 



| 



9* 2D$(t + 
3V 



Byerly, " Fourier's Series," pp. 93-95. 
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and it is seen that the solution in (73) satisfies the differential equation 
(71). 

For the case in which D is real, it is found that this solution corre- 
sponds to the diffusion of C particles which originally (t = T) were 
concentrated in the plane x = 0. 

In the present case, let us define 2Dr as a real positive magnitude by 
the relation 2Dr = a 2 . Then we can write equation (73) in the form 



f 



2 , 1 
27TM 



Also, 




Hence, 

r2 ^ 

- - a2-f(Ae/27TMa)> /7J.^ 

"FV ' w 



Evidently, ^ represents the same type of Gauss error function as that 
defined in equation (54), in which 1/c 2 is replaced by 



a 2 + r~- (75) 

^ 



For t 33 0, the absolute value of ^ for a; = is given by C 2 /a 2 . With 
increase in the absolute value of t t a t increases, with the result that the 
maximum value of ^ decreases and the packet spreads out more 
and more. This may also be described as follows : 

The probability of occurrence of the particle in the range XQ ^ x ^ X Q 
is given by the integral 



Equation (74) shows that, with increase in [ 1 1, the value of P for the 
same limits decreases. Furthermore, since ^ is symmetrical with 
respect to t = 0, it follows that this increasing uncertainty in the value 
of P extends to both the past and future. That is, no matter how exactly 
the value of P may be defined for a narrow range at t = 0, the gradual 
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spreading out of the packet with time will result in a correspondingly 
increasing uncertainty in the value of P. 

While this argument has been applied in the previous paragraphs to 
the case in which the average value of x remains zero, a similar argument 
applies to the case of motion in a field of force. For an instructive illus- 
tration of such a calculation the reader should consult the discussion 
by C. G. Darwin 7 in which he has demonstrated that the law of motion 
of a packet in a gravitational field, as deduced by the methods of quan- 
tum mechanics, is identical with that deduced in ordinary mechanics. 

COLLATERAL READING 

1. The Correspondence Principle. For further discussion consult Ruark and 
Urey, "Atoms, Molecules and Quanta," Chapter VI; Slater and Frank, "Theoretical 
Physics," Chapter XXX. 

2. Wave Mechanics Theory of Radiation. Condon and Morse, Chapter III; 
Ruark and Urey, pp. 542-47; J. Frenkel, "Wave Mechanics," Vol. 1, p. 124, et seq. 

3. Intensities of Spectral Lines. A very comprehensive discussion is given by 
H. Bethe, "Handbuch der Physik," XXIV/1, p. 429 et seq. 
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Academische Verlagsgesellschaft m.b.H., Leipzig, 1930. 

7 Proc. Roy. Soc., A117, 258 (1928). See also E. H. Kennard, J. Franklin In- 
stitute, 207, 47 (1929). 
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APPENDIX II 
VALUES OF PHYSICAL CONSTANTS 1 

Molar volume of ideal gas VM 

At 0C. and 760 mm. Hg (= 1.0132 X 10 6 dyne/cm. 2 ), V M = 22,414 cm. 8 
Molar gas constant, 

R 8.3136 X 10 7 erg deg- 1 mole " x 

= 8,2046 X 10~ 2 liter atmos deg" 1 mole" 1 
* 1,9864 calis deg" 1 mole" 1 . 

Faraday constant, F = 96,490 abs-coul g-equiv r l 
Velocity of light, c 2.99796 X 10 10 cm sec.- 1 
Electronic charge, e = 4.770 X 10" 10 abs-es-units. 

e/c = 1.591 X 10" 20 abs-em-units. 

Specific electronic charge, e/m Q = 1.761 X 10 7 abs-em-unit g" 1 
Mass of electron, ra = 9.035 X 10~ 28 g. 
Planck's constant, h = 6.547 X 10~ 27 erg sec. 
Avogadro's number, NQ = 6.064 X 10 23 mole"" 1 
Boltzmann constant k = R/N Q = 1.3709 X 10" 16 erg deg" 1 
Kinetic energy per molecule at 0C. 

E - ffc(273.18) = 5.617 X 10" 14 erg. 

Atomic specific heat constant, ft = h/k == 4.7757 X 10~ n sec - deg. 

1 008 
Mass of hydrogen atom, WH = ' ' ~ 1-6618 X 10~ 24 g. 

Einstein's constant, h/e = 1.3725 X 10" 17 erg sec es-unit" 1 
Energy of F-abs-volt-electron, 

hv = \rntf? = Ve = 1.591 X 10" 12 erg X V. 



Wave number associated with V-abs-volt-electron, 

V Q = 8106 cm - 1 X V. 
Wave length associated with F-abs-volt-electron, 

X - - = 12,336 X l(T 8 cra V" 1 . 
"o 

Mechanical equivalent of heat, 

JIB = 4.1852 abs-joule calU 1 . 

1 These values are taken from the publications by R. T. Birge, Phya. Rev. SuppL, 
1, 1 (1929), and Phya. Rev., 40, 228 (1932). 
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Energy per mole for 1 abs-volt-electron per molecule, 

F 
-=. = 23,O55 calis mole"" 1 . 

Radius of Bohr orbit, 

h 2 

= O.5282 X 1O~ 8 cm. 



Bohr unit of angular momentum, 

X = ^ = 1.042 X 10- 27 erg - sec. 
Magnetic moment of 1 Bohr magneton, 

^ = 9.175 X 10~ 21 erg - gauss- 1 



Rydberg constant for infinite nuclear mass, 

= 109,737.42 cm.-' 



Rydberg constant for hydrogen, RH 109,677.76 cm." 1 
lonization potential for H atom, 

27r 2 e 4 m 

Rttch = - = 13.53O abs-volt. 

h 2 

Schroedinger constant for electron 

87r ^ == 1.664 X 10 27 g - erg" 2 sec.- 2 
Schroedinger constant for H atom 

3.062 X 10 30 g erg" 2 - sec. ~ 2 

ri, 

E)e Broglie Tsrave length associated with V -abs-volt-electron 

k h 12.21 X 10~ 8 cm F-* 



o v ^/2m Q Ve 



APPENDIX III 
SPECIAL TABLES OF MATHEMATICAL FORMULAS 

The following tables contain a number of relations which are of special interest in 
quantum mechanics calculations. For more complete tables the reader should con- 
sult the references given in the last section. 

1. Series Expansions. On the basis of the binomial theorem it is shown that 
for positive integral values of n 

(x + y) n - x n + nx n ~ l y + * ~ s n ~V + . . . 

n(n-l). . .(n-fc + 1) kk 

-t- kl y T 

+ no*/"" 1 -f 2/ n . 

For fractional and negative values of n the series converges for y/x < 1. If 
n > 0, the series also converges for | y/x \ = 1. 
Special cases: 



' 28 16 128 256 1024 

: lim(l+-V -1+7,+55+Ti + * ' ==2 ' 718282 
n=oo\ n/ 1! 2! 3! 



6*, where o = log b = In 6 = 2.30259 logiob. 
3? x* 



i , 

coax .1--+--... 
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x 8 x 6 

sinh x m -t sin (fa) x + + + . . . 
3! ol 

X 2 X 4 

coshs - cos (fa) * 1 + + ~ + . . . 
IT - 3.141593. 

2. Algebraic Equations. (1) The two roots of the quadratic equation 

ox 2 + bx + c m 
are given by the relation 

b Vb 2 - 4oc 



(2) Given the system of n equations with n unknowns, 
+ aiaS2 + . . . + ainXn = 

+ (&22X2 + . . . 



a n ixi + a n 2X2 + . . . + a nn x n c n 
the solution is given by * 

* - ~ 



where 

<*2n 



and DtA is a determinant of order n 1 which is obtained thus: 
Dik = ( I)* 4 "* X det. obtained by omitting from Z) the ith row and kth column. 
Hence 



For Ci C2 =* . . . = c n =* (homogeneous system of equations), x\ =* X2 == . . . 
= x n if D 9* 0. For D = 0, the system of equations determines the ratios of the 
unknowns. That is, 

xi :x 2 : . . . : x n = -Dn : Di2 : . . . : Di n . 
1 This solution is taken from L. Silberstein's Mathematical Tables. 
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3. Exponential, Trigonometric, and Hyperbolic Functions. 

sin (x y) - sin x cos y cos x sin y. 
cos (x d= y) = cos a; cos y =F sin x sin y. 
2i sin ox - *" - e-* 1 * (where i - V^l). 
2 cos ax = ** + r. 
iax = cos ox -M sin ax. 
e -to* _ cos ox _ i s in oa:. 

a s x 8 a B x 6 



sinh a, --e-- 



2 X 2 

eoth-^^ + .-)-l+ ir + ir + ... 

ax = cosh ax + sinh ax. 
-* = cosh ax sinh ax. 

sinh ax 1 - r** 
x__i_ _-_ _____ _____ - _______________ 

tann ox = . ., , _o, 

cosh ax l-f-6 MX 

cosh 2 ax sinh 2 ax = 1. 
DeMoivre's formula: For integral, fractional, and negative values of n, 

(cos 6 + i sin 0) n = cos n0 + i sin n0. 
Given 

2 2 . 3.2 + ^2 = (3. ^. ty) (^ _ ^J^ 

the complex number (x - iy) is the conjugate of (x + iy). 

| 2 | = 4- Vx 2 4- y 2 , is known as the modulus, 
| 2 | 2 norm of 2. 

With polar codrdinates such that x = r cos 0, y - r sin 0, 
r = | 2 | ; 2 =s r (cos 4* i sin 0) = r<r. 

The complex conjugate of 2 is designated by 2, and hence 
z = r(cos - t sin 0) r~*^ 

4. Definite Integrals. 

- Ts v 

2 sin 2 xdx - I cos 2 xdx = T 
Jo * 



r- 
r ; 



!2 
sin 3 xdx - / Z cos 8 xdx =- 
o 
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rn - 1 /* 
sin n xdx - - I 2 8m -2 *<fc forn 2. 
n Jo 

r* n-i r 1 

I 2 cos w zeto I 2 cos n ~ 2 xdx for n 2. 

Jo w Jo 

I sin ma; sin ngdx =1 cos ma; cos rmfo = for m & n. 
Jo Jo 

X*- m 1 3 5 . . . (m - 1) 1 8 5 . . . (n - 1) 

sm m :c-co8 n *<te- 2 - 4 - 6 . . . (m + n) 

where m and n are both even integers, 

2 4 6 . . . (m - 1) 

~~ (n + 1) (n + 3) . . . (m + n) ' 

where m is odd. 
The error function is defined by the integral (that of Gauss) 

2 r x 

erf x = i: I e~ dx. 

VTT Jo 

The exponential integral 

rv 
du. 
u 

Eider's constant 

/*< /fr / 1 1 \ 

-i. ] = 0.5772. 



Evaluation of integrals of the form J n = I x n e *dx. 



r- 

J *i 



r 



<*c 

'i 



x n e~*dx =n 

That is, the value of J n is expressed in terms of J-i. Proceeding in this manner 
it is shown that 

x* x* 



/x 

Consequently, 

I x n c~*dx n!, 

and 

/* 

:nl- 
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From these relations the following integrals are derived: 

r/*oo /* 

*-*dx - 1; I r*dx - -*; I r*dx 1 - C*. 

Jx t/0 

I xr*dx - 1; f xe-*efe - r*(l + ); 

f xr*fe - i - r* 



a: 2 
+-- 

2 6 

Integrals involving xV m * may be reduced to the same form thus: 



r/* 00 
z 3 -*cfcc = 6; I 
/x 



R. Eisenschitz and F. London, Z. P%stfc, 60, 491 (1930), give a table of " Ex- 
change " and Coulomb integrals which occur in the treatment of the problem of inter- 
action of two hydrogen atoms. 

The Gamma Function or Eider's Integral of the Second Kind is defined thus: 



/o 
For positive integral values of n 

T(n) = n(n - 1) - 1 2 . . . (n - 1) = (n - 1)! 

r(i) =n(o) = i; r(o) = . 

Integrals which are of special interest in the kinetic theory of gases (see Appendix 
in J. H. Jeans' " The Dynamical Theory of Gases "): 



PW^* = i-8-5.,.(-i) 

Jo 2^ 1 



nl 



f o 
Special cases which are of importance: 



r- 



r m * 1 

3 """SfcT j 
X 8m^ 
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6. References for Numerical Values of Functions and Integrals. 

(CH) " Handbook of Chemistry and Physics," Chemical Rubber Publishing Co., 

Cleveland, Ohio. Annual publication. 
(ST) " Smithsonian Physical Tables," eighth revised edition, edited by F. E. 

Fowie, Smithsonian Institution, Washington (1933). 
(MS) " Mathematical Tables/' L. Silberstein, G. Bell & Sons, Ltd., London, 

England. 

(MP) " A Short Table of Integrals," B. O. Peirce, Ginn & Co., Boston. 
(JE) " Funktionentafeln mit Formeln und Kurven," E. Jahnke und F. Emde, 

B. G. Teubner, Leipzig and Berlin, 1933. 

Function Reference for Tables 

CH;ST;MP;JE 



tr 1 , n 2 , n 3 , n, n CH; ST, (n - 1 to 999) 

n\, log n\ CH, (n = 1 to 100) 

n 4 , n 6 , n 6 , n 7 , n 8 CH, (n - 1 to 100) 

Erf function MS, MP 

x2 , e-^ ST 

Ei(x); r(n) forn = 1 to 2 ST; JE 

Diffusion integral 



Zonal spherical harmonics, ST, JE 

Derived Legendre polynomials, JE. 
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SOME FUNDAMENTAL THEOREMS AND 
DIFFERENTIAL EQUATIONS 

Quantum mechanics is a development which has its basis in classical physics. 
It represents an evolution and not a revolution, as has been imagined by many, 
especially those who have not taken the trouble to study the logic and technic 
of the subject. From this point of view the student of quantum mechanics will find 
it of interest to renew his acquaintanceship with some of the concepts and theorems 
which should be regarded as the foundations of theoretical physics. 

In the following sections an attempt has been made to present a summary, or 
rather digest, of some of the theorems and differential equations which are of a 
fundamental nature and of such general scope that they have found application in 
many diverse fields of physics. 

Many of these theorems may be derived either by direct mathematical argu- 
ment or by application of the concepts of vector analysis. The latter method is the 
more concise, and, though its symbolism may seem artificial, there is a deep physical 
significance in the use of vectors. 

The presentation given in the following sections follows closely the excellent 
summary given by R. E. Doherty and E. G. Keller in " Mathematics of Modern 
Engineering," Vol. I, Chapter III. The bibliography at the end of the chapter 
gives references to a number of other works on the topic of vector analysis. 

1. Vectors and Vector Products. All elementary physical quantities may be classi- 
fied into two groups: vectors and scalars. A scalar is that which has magnitude only; 
a vector has both magnitude and direction. For in- 
stance, mass, length, and energy are scalars, but ve- 
locity, force, and momentum are examples of vector 
quantities. 

A vector is represented, as shown in Fig. 1, by a line 
drawn in the direction in which the magnitude is effec- 
tive, and an arrowhead is used to indicate the sense. 
To distinguish between them, vectors are usually Z 
printed in bold-face type, and scalars in italics. 

The vector R has the magnitude r, and the direction 

indicated by the arrow. The convention regarding sign is as follows: The positive 
direction of the z-axis is taken to be that in which a right-handed screw advances 
as it is rotated in the sy-plane from the positive direction of the x-axis to that of 
the 2/-axis, as indicated in Fig. 1 by the circular arrow in the x2/-plane. 

The direction of a vector is usually indicated by the three direction cosines (see 

Fig. 1) 

I = cos (a;, r) = cos a j 

m = cos (y,r) - cos/3 | d) 

n cos (, r) * cos y J 

Hence, ,_. 

J 2 +m 2 +n 2 l. (2) 
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FIG. 1. 
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If h, wi, m and Z 2 , w 2 , w 2 are the direction cosines of two vectors R and Q, drawn 
from the same origin, the angle between the two lines is given by 

cos 6 = lil* + raim 2 + n\n%. (3) 

It follows that if the two vectors are at right angles to each other, 

l\h + wiwi2 + wm2 = 0. (4) 

A unit vector is one whose magnitude is unity, and the three unit vectors i, j, k 
correspond to vectors of unit length directed along the three rectangular coordinate 
axes x, y, z, respectively, as shown in Fig. 1. Any vector RI is therefore described by 
the three component vectors thus: 

RI = isi + jyi + fc*i, (5) 

where r\ the magnitude of the vector R is given by the relation 



If we have another vector 

R 2 - i3 2 + J2/2 

making an angle 6 with the vector RI, the resultant vector Q is given by the law of 
parallelogram of velocities 

| Q | 2 = | R! | 2 + | R* | 2 + 2 | Ri || R 2 | cos 9, (6) 

where the vertical lines indicate that the absolute value of the vector is to be used. 

The scalar or dot product of two vectors is a scalar and is written in the form RI R 2 . 
It is denned by the relation 

RI R 2 = (tei + jyi + tai) (1x2 + m + fc*2). (7) 



Since i i = j j = k k = 1 1 ,- 

and i-j- j-k-k-i =0 /' w 

it follows that RI R 2 * R 2 RI = x& 2 + 2/12/2 + 2122- (9) 

That is, the order of " multiplication " is unimportant. 

The vector or cross product of two vectors is a vector 

k RiX R 2 perpendicular to the plane containing the two vectors 

and is regarded as positive if the product is taken in 
the same direction as that of the rotation of a right- 
handed screw. Thus, in Fig. 2 

Q - RI X R 2 = -R 2 X RI. 

That is, the order of " multiplication " is significant, 
and the product is said to be non-commutative. The 
absolute magnitude of the vector Q is equal to the 
area of the parallelogram of which RI and R 2 are the 
sides. That is, 

Itfollowsthat i X i = j X j k X k - 

j X k = i = -k X j 
kxi j -i Xk 
i x j =k = -j Xi 

It is important to note the cyclical order of the factors in the last three equations. 




(11) 
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2. Line and Surface Integrals of Vectors, (a) Line Intergrals. The work done 
by a force F on a particle in moving a distance dr is given by F cos Odr, where is 
the angle between the direction of F and that of dr. (In Fig. 3 dr is directed along 
the tangent to the curve.) In vector notation, work done in traversing the distance 
AB along the curve is given by 



cos Odr 



fB B 

I F dt = I F 
JA JA 



(iF + jF y + W z ) - (idx + jdy + fafo) 



+ 



(12) 






FIG. 3. 



FIG. 4. 



(6) Surface Integrals. Consider a region in which there is a potential energy 
function. Let S designate a closed surface drawn in this region (see Fig. 4), and 
consider a prism P, in this region, of cross section dydz, the sides of which are parallel 
to the OX-axis. Let dSi and dS^ denote the intercepts of this prism on the surface 
S, and let l\m\n\ and lyntfii denote the direction cosines of the normal to these 
two elements of surface respectively. 

Then dydz = hdSi = -l^dS* and F^ydz = FihdSi - FsZtfttfe denotes the net 
flux through the prism P in the outward direction, where FI and F 2 designate the 
values of F x at x\, y, z and x^ y, z respectively. 

More generally, if n designates the unit distance along the normal to any element 
dS of the surface, this element can be represented by a vector 



and the integral 



dS = ndS, 



r r 

I F - ndS - I 1 
/s /s 



(13) 
(14) 



is the surface integral of F over the surface S. 

The surface integral thus defined evidently corresponds to the total flux of F 
through the surface S. If F is the intensity of heat flow (e.g., in calories) per unit 
time, per unit area perpendicular to the direction of flow, the surface integral repre- 
sents the total heat flow through the surface per unit time. 

Similarly, F may represent a velocity vector, e.g., the mass of fluid crossing unit 
area per unit time. Then the surface integral represents the total flow through the 
surface, per unit time. 
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3. Derivative of a Vector. Let us consider the vector r of magnitude r, where 

and x, y, and z are each functions of t . 

Then f defines the magnitude of a vector f, and we can express this -vector in 
terms of the components thus: 



. dr . dx dy dz 



(16) 



Similarly, we can express the acceleration in the direction of r as a vector by the 
relation 

j2- j2~ J2.. J2- 

J' (17) 



It is readily shown that 












(18) 



at 



and that if R R(s, y, z), then 

f Rdr - i JRJr + jJRydr + k 



(19) 



(20) 



where dr = dxdydz. 

4. The Gradient* Let V(x, y, z) be a scalar point function. For instance, V 
may designate the potential energy function, or the temperature at any point in a 
given region. It is assumed that V is a continuous and 
singly valued function of the coordinates. 

For any value V = C, a constant, this function will be 
represented by a surface, designated an equipotential 
Let us consider two adjacent surfaces defined by the 
equations V - C and V + A 7 - C + AC (see Fig. 5). 
The derivative of V along the normal is dV/dn, and 
this is greater than the rate of change of V in any other 
direction, dV/dr t since dn is less than dr. 
Evidently dV/dn is a vector, and it is known as the gradient of V. This is usually 
designated thus: 




C+AC 



FIG. 5. 



an 



(21) 



where n is a unit vector in the direction of the normal. 
Since 



APPENDIX IV 423 

it follows that we can also write 

. 



where the symbol V, called " del/' is a vector operator defined by the relation 



(23) 

From this we deduce the operator 

**-*-+$+. <> 

which is designated " del squared " and is also known as the Laplacian. 

As an application of the concept of gradient let us consider the flow of heat through 
a medium. The flow occurs in the direction of greatest rate of decrease of tem- 
perature, and is given by the vector relation 

q _ - k - VT, (25) 

where q is the heat flux, VT is the gradient of temperature, and k is defined as the 
coefficient of heat conductivity. 

It will be observed that the dot product of the vector operator V and a scalar 
function is a vector, which is designated the gradient of the scalar function. 

6. Divergence. The result of operating with V on a vector function F(z, y, z) 
is a scalar function which is designated the divergence of F. From equation (23) 
we have the relation 



- 

Let F x , F v , and F, denote the rectangular component of F. Then 

-!* + W v + fcP.. 
Hence, 

V F - (i + j j- + k I-) (IF. + JF , + kft) 
\ ox oy oz/ 

.^S + Wi+Wf (27) 

ox oy oz 

divF. 

If we regard F as a flux, that is, number of lines per unit area, the term divergence 
means the number of lines which spread out per unit volume. This interpretation is 
illustrated best by the application of the concept in the derivation of the equation 
of continuity. 

6. The Equation of Continuity. Let us consider a rectangular element of volume 
dxdydz = dr, at x, y, 2, in a fluid of density p = p(x,y y z). Let v denote the velocity 
of the fluid at the given point. The mass of fluid M , flowing through a unit area, per 
unit time, is given by the mass of a prism of fluid of unit area and length v. That is 

M = pv, (28) 
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where M and v are vectors. Let M*, M y , and M, denote the rectangular com- 
ponents of the vector M. 

The rate at which fluid enters the area dydz in the positive direction of the z-axis, 
at x, y, z, is 

pvdydz - Mydydz. 

The rate at which fluid leaves the opposite area at x + dx, y, z is 

> dydz * f MX + "T-^c 
\ ox 

Hence the net increase in the mass of fluid inside the element dr t due to the vector 
component M x , is 

Adding the net increments per unit time due to the inflow and outflow through the 
other four sides of the element dr, we obtain the relation 

* /^ + ^ + ^?Y (29) 

That is, 

V - ( P v) =V-M=divM = -~. (30o) 

ot 

This is known as the equation of continuity. If p is constant throughout the fluid, 
this equation assumes the form 

dp 



0. (306) 

If the fluid is incompressible, dp/dt - 0, and hence 

V M - 0. 

" The name divergence," as L. Page remarks, " originated in this interpretation of 
V M. For since V M represents the excess of the inward over the outward 
flow, or the convergence of the fluid, so V M represents the excess of the outward 
over the inward flow, or the divergence of the fluid." 1 

7. Gauss 9 Theorem. Let X, Y, Z be the components of a vector F, a continuous 
function of the coordinates, and let S designate a closed surface enclosing a region of 
volume V. 

Gauss' theorem states that 



f (?+? + ?V = f 
Jv\dx dy fa/ Js 



(31) 

where I, m, n are the direction cosines of the normal drawn outwards to the surface 
element dS. 
Since 



1 " 



Introduction to Theoretical Physics," p. 22. 
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it follows that 

/f f?* m f f 
J J dx J J 

where Xi and X 2 are the values of X at the two elements dSi and dS z in which the 
prism of cross section dydz intersects the surface S (see Fig. 4). 
But 

dydz 
Hence 

(Xi- 



Integrating through the elementary parallel prisms and adding, we obtain the 
relation 



/ % y 

I _d r = 

Jyto 



In a similar manner it is shown that 

AV (* 

dr = wYdS, and I ~~ 



from which equation (31) follows. 

From equation (27) it follows that (31) may be written in the form 



Xr 
V - Fdr = I ( 
Js 



But the integrand on the right-hand side is equal to F cos 6, where is the angle 
between the direction of F and the normal at the element dS. Hence, 



XV - Frfr = / F - n&S - IF- dS. 
Js Js 



(32a) 



If F is the derivative of a scalar point function V, which together with its deriva- 
tives are finite and continuous over the configuration space, then 

- - V - _ F = _ 

f X ~~* """ " v ) V ~"~ """' *\ 9 * Z """ v 

dx ay oz 

and F cos = 

Hence, equation (32a) may be written in the form 



JV - Fdr f(divF)dr - - \ 



S* w 



or as 



/* /*dF /* 

I V*Vdr - I ^dS o I (grad 7)iB. (32c) 

JF Js on Js 



Gauss' theorem (to be distinguished from Gauss' law, which is discussed in a 
subsequent section) states a relation between the integral throughout a region V 
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of the configuration space and the integral taken over a surface S enclosing the given 
region. It is one of the most fundamental theorems in theoretical physics. 

8. Cross Product of Vectors. An important application of such a product is the 
theorem designated as Stokes' Theorem, 2 which states a relation between a surface 
and line integral (see section 2) in the form 



/V X F - dS - / F dr, 
Js Jc 



(33) 



where the second integral is taken along the entire curve C, enclosing the surface 8. 
The operator V X F is known as the curl of the vector function F. 
From the rules for forming the cross product of two vectors it follows that 




dy dz dz dx \dx by 

- curl F. 

The curl of F thus has the components defined by the bracket terms in the last 
equation. This operator is of special significance in electro- 
magnetic theory, but is also used extensively in other fields of 
physics. 

Another illustration of the cross product of vectors is obtained 
from mechanics. The moment of a force F about a point (see 
Fig. 6) is defined by the magnitude F X r sin 0. This corresponds 
to a vector 

M - r X F (35) 

r sin0 A 
FIG. 6. f which the direction, as follows from Fig. 2, would be perpen- 

dicular to the paper and pointing directly up. 
Resolving the vectors r and F into their rectangular components, 

M . r X F - (i* + jy + k*) X (iF x + JF V + kR). 

Applying the relations for cross products of unit vectors given in (11), we obtain 
the relation 

M - iM x + jM y + kM,, 
where 

M x - (yF. - zF v ) } 

M v = (zF x -*R) (36) 

M. - (xF v - yF x ) J 

9. Equation for Flow of Heat. In section 3 it was pointed out that the heat flux 
through a body is given by the relation 

q - -fcVT. 

If there are no sources of heat within the surface, the heat flux through the"surface 
must be equal to the rate at which heat leaves the body. Hence 



XdT r 
- cpdr = I -WTdS, 
J& 



(37) 



where c ~ heat capacity per unit mass, 

p - density. 

2 For proof of this theorem see L. Page, op. cti., p. 31. 
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The right-hand side of (37) can be written in the form 

- fkVTdS - f n - qdS, (38) 

Js Js 

where n is the unit vector normal to the surface at the element dS. 
But in consequence of Gauss' theorem, equation (32a), 

f n - qdS = f V - qdr - - f V - kVTdr. 

Hence we derive from equation (37) the partial differential equation 

ar i 

(39) 



ot cp 
If ft is assumed constant, this equation becomes 



(40) 
at cp 

While equation (39) is the general differential equation for heat flow, equation (40) 
is the form usually adopted. The solutions of this equation give T as a function of 
both t and the space coordinates. 

Equation (40) can be applied to other types of " flow " as well as heat flow. In- 
stead of a gradient of T, we can have, in a solution, a gradient of C, the concentration 
of solute. The resulting form of equation (40) is 

ftri 

-^=Z>.V*C, (41) 

where D is the coefficient of diffusion. Equation (41) is known as Pick's equation. 
10. Equation for Wave Motion. The differential equation for the propagation 
of a wave motion may be deduced directly, as in section 2.4, or by application of 
vector analysis. In terms of rectangular coordinates the equation has the form 

where 4> is the amplitude and vis & constant which is known as the phase velocity. 

In order to integrate this equation we use the method of separation of the variables 
and postulate, as in section 2.4, that we can represent <j>(t, x, y, z) as a product of 
four functions in the form 



If this is a solution, then we must obtain the relation 



where nf is a constant. 
This leads to the solutions 

0(0 - An*"' + 

X(x) - Bw*" + 
7(y) - 

Z(z) - 
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Evidently, mv has the dimensions of a frequency v, and therefore m must have the 
dimensions of a reciprocal length L which is fixed by the boundary conditions of the 
problem. That is, m = nv/L, where n will be an even or odd integer, and 2L/n = X, 
a wave length which has three components, one for each of the axes of coordinates. 

11. Gauss' Law. Let us consider a surface S drawn about the point P (see 
Fig. 7) at which is located a positive charge e. With P as vertex draw a cone of solid 




FIG. 7. 

angle du and let dS\ and dS<t designate the intercepting surface elements. Let 
6 1 and 02 designate the angles which the axis of the cones makes with the normals to 
the surface element. 
Then, 

dSi cos 0i = r\du 

dS% cos 02 = fidtt 
so that 

dSi cos 0i dS<t cos 02 



But the force F on a unit charge, directed outwards along the normal, is given by 
c/r? cos 0i at dSi and by e/r\ cos 02 at dS* Hence 

F*dS* + FidSi = - cos BidSi + ~ cos 2 dS 2 
ri r\ 

= 2e dw. 

If now we continue to divide up the surface into elements by an infinite number of 
cones, and add up the contributions to the total flux through the surface, we shall 
obtain the relation 



FcosBdS = F - ndS - F - dS = 2e dco - 4ire. (44a) 

J5 JS Jo 

This result is evidently valid if instead of one charge located at P we have a 
number of charges distributed inside the surface S. In that case, if p p(x, y, z) 
designates the distribution function for the positive charge in each element of 
volume, equation (44a) becomes 

f F dS - 4* fpdr. (446) 

Js Jv 
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It is evident that, if the surface S is taken outside the point P' at which the charge 
is located (see Fig. 7), then the flux through the element dS* (located at A') is 
equal and opposite to that through the element dSi (located at ') Hence equations 
(44a) and (446) assume the form 

f F dS - 0. (45) 

Js 

Equations (44) and (45) are statements of Gauss 9 Law. 

12. Equations of Poisson and Laplace. Let us now combine equations (446) 
and (45) with Gauss' theorem which was derived in section 7 in the form 



XV Fdr - / F dS. 
JS 



(32) 
JV t/S 

We thus obtain the relation 

/F 
V Fdr - 4ir I pdr, 
t/V 

which, in consequence of (27), becomes 



That is, 

3F X dF v dF z A , J/t . 

T^IT^T^ 4 ^' (46) 

dx dy oz 

Now, if the force components are derived from a potential energy function V = 
y(z,y,z),then 

* - Z. p - dv . F - 2Z. 

^"""te'^"" ~"%'*'~~cfe' 
and therefore equation (46) assumes the form 

=-4P. (47) 



Equation (47) is known as Poisson' s equation. 

If p = 0, that is, if the potential energy function is taken over a region hi which 
there is no electric charge, equation (47) assumes the form 



which is known as Laplace's equation. 

13. Transformation of Coordinates. In many cases it is more convenient to use 
a coordinate system different from that involving Cartesian coordinates. By ap- 
plication of Gauss' theorem it is possible to derive the form of the operator for 
such a system. 

Let 
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denote the generalized curvilinear coordinates as functions of the Cartesian coordi- 
nates. In the following considerations it will be assumed that the g-coflrdinates are 
of such a nature that three surf aces qi = constant, q$ = constant, and 93 = constant, 
intersect orthogonally. Such a system is of the type designated " orthogonal curvi- 
linear." 

" In Cartesian coordinates the three families of surfaces are families of planes 
parallel to the different coordinate planes. In polars we have a family of con- 
centric spheres, a family of cones with the same axis and vertex, and a family of 
planes intersecting in the one straight line. In cylindricals we have a family of co- 
axial cylinders, a family of planes intersecting in the axis of the cylinders, and a 
family of planes at right angles to the axis of the cylinders. In elliptic coordinates 
we have three families of confocal conicoids, ellipsoids, hyperboloids of one sheet 
and hyperboloids of two sheets." 3 

Let us consider the element of volume bounded by the planes 



This will be approximately rectangular, and the length of the side between the 
surfaces qi dqi/2 and qi + dqi/2 is a\dq\, where a\ is a function of qi 9 g 2 , and 93. 
Similarly, let atfqz and a 3 dg 3 denote the lengths of the*>ther sides where a 2 and 03 
are each functions of the three generalized coordinates. 

Let Qi, Qa, and Qs designate the components of force normal to the surfaces 
q\ 9 92, and 93 respectively. 

Then the area of the surface at q\ is a^a%dq^Aqz 9 and the flux of force through this 
area is Qia^dg^s. The difference between the flux at q\ + dqi/2 and that at 
q\ dqi/ 2 is given by 

a 



Adding up the net amounts of flux through the other two pairs of surfaces, and 
applying Gauss' law, the result obtained is 



I 
[dqi 

That is, 



(Qiajjas) + (Qjzaias) + (Q 3 o 102) \dqidqtfq3 - 
dqi dq* dq* J 




- (Q 8 aia 2 ) 1 - 4irp - 0, (49) 
093 . J 



which is a more generalized form of equation (46). 
Let V - V(qiq<flz) denote the potential function. Then 

dV dV dV 



and equation (49) becomes 

8 /auas dV\ d /aia s dV\ d , A , , , A /Kn , 

I- 22 37 1 + r- 1 3- ) + r"l ~ TI + **paiW*fa*b*fa* - 0. (60) 
dqi \ ai dqi/ dq^ \ a 2 dg 2 / fys \ 8 dqzj 

8 R. A. Houstoun, <4 An Introduction to Mathematical Physics," p. 21, on whose 
remarks this section is based. 
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Comparing the last equation with equation (47), it follows that 



The element of volume is evidently 

dxdydz - a\a&sdqidq$dq3 9 (52) 

and the element of distance dS is given by the relation 

(ete) 2 - al (dqtf + ol(d0 2 ) 2 + oi(d ?3 ) 2 . (53) 

The product 010203 is known as the discriminant of the transformation. 4 
Illustrations of transformations to cylindrical spherical polar and confocal elliptic 

coordinates have been given in different sections in the preceding chapters. 
14. Green's Theorem. Equations (31) and (446) lead to the relation 

. (54) 

In this equation X, Y, and Z may be written as the force components derived from 
a potential energy function V, such that 



Let 

A 

dx 



where $ and ^ are scalar point-functions which with their derivatives are uniform, 
and continuous in the configuration space. From the equations in section 7 it 
follows that 



Also 



(9^ d cHA (fa 
z ^ + 1M ^ + n ) = * 



Hence 



4 It should be observed that the magnitudes designated by 01, 02, and 03 corre- 
spond to the squares of the magnitudes designated by the same letters in equation 
(4.22) and in section 6.2. 
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This is one form of Green's theorem. A second form, which is more generally used, 
is derived from (55) as follows: If, in equation (55), * and are interchanged, the 
result is 

" 

Now let us subtract (56) from (55). The result is the second form of Green's 
theorem 



f ($V 2 * - *V 2 *)dr ~ - f ( * ^ - ^ ~ J dS. (57) 

J v Js\ on 6n/ 

Because of equation (21) this may be written in the form 



*V &)dr I (* grad ^ * grad *)d/S, (58) 

/S 

which is extremely useful in quantum mechanics. 

15. Proof that Solutions of S. Equation Form an Orthogonal System of Functions. 
In the S. equation 



let <f> n and m designate two eigenf unctions corresponding to the eigenvalues E n and 
E m respectively. Then 

(i) 



and - VJ m + V* m = E m * m . (ii) 



Multiplying both sides of equation (i) by J w and integrating over the configuration 
space, the result is 

~ ^ f *mV 2 Mr + J VjntndT - E n Jt m <t> n dT. (iii) 

Repeating the operation with $ n on equation (ii) the result is 

- E m J<t>n* m dT. (iv) 

Subtracting (iv) from (iii) we obtain the relation 

- ~ J(5mV% n - 0nV 2 J m )dT - (E n - E m ) J Jm^n^r. (v) 

From equation (57) it follows that the left-hand side of (v) is equal to 



Now it has been postulated that the solutions of the 8. equation must represent 
functions which vanish at the limits of the configuration space. For instance, 
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the functions 4> n > 0> etc., must vanish for r * , or for x *= y z *>. If 
we take the bounding surface S at these limits, then the integral in (vi) vanishes. 
Consequently, 



- Em) f 



0, 



and if the state corresponding to E n is not identical with that corresponding to E m , 
then En - E m y* 0. Hence 

I 4m<t>ndr = forn 7* m, 

* 2V 2 for n = m, 

where 1/JV is the normalizing factor. 

16. Solution of Laplace's Equation in Terms of Zonal Harmonics. 6 In Chapter 
VI the solution of Legendre's equation (6.21) 

2x h ct 2 X = (59) 

was obtained by assuming that X could be expressed as a series of terms of the form 
a n z n and then determining the coefficients from the condition 

a 2 = m(m + 1), 
where m = 0, 1, 2, etc. 

It is possible, however, to obtain a solution of equation (59) by an entirely dif- 
ferent method which is of special interest in connection with the theory of potentials. 

The potential energy due to a unit of mass (or unit of charge) concentrated at a 
given point x \y\z\ is 

y*- . * . (i) 

r v (x x\r + (y yi) + (* *i) 
That this is a solution of Laplace's equation 



is readily verified. By differentiation of (i) we obtain 
dV (x - xi) 



dx {(x - si) 2 + (y - 
and 



From similar relations for the derivatives with respect to y and 2, we obtain the 
result 

Q Q 

yap _ _. _ . o. 

8 Based on discussion by Byerly, Chapter V. 
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In terms of spherical coordinates r, 6, and 17, (i) becomes 

7 = [r 2 - 2rri {cos 9 cos 0i + sin sin 0i cos (y - 171)} -f jfj-f, (ii) 

which is a solution of Laplace's equation in spherical coordinates. 
Let us consider the case in which 0i = 0, and V is independent of 17. Then 

V - (r 2 - 2m cos 4- r?)"* (iii) 

is a solution of Laplace's equation 



* 

Equation (iii) may be written in the form 



or F--(l -2-cos0+- 2 ) (vi) 

ri \ n ri/ 

For n/r < 1, the expression in parentheses in (v) may be expanded into a con- 
vergent series, so that 



where p m is a function of cos 0. 
Hence 



F--LM- . wo 



: ^- ~Zpmp) (m + 1), 
or \r/ 



and 
Consequently, 



Since this must be valid for all values of r > ri, the coeflScient of each power of r 
must vanish identically, and therefore 
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Now if we set x COB 9, then 



T = - r T sin 0, 
d8 dx dB dx 

and equation (viii) assumes the form of Legendre's equation, 

(1 x ) r" 2a5 -f- iw(wi -f- l)pm ** 0. (60) 

Hence, equation' (vii) is a solution of this equation. 
In a similar manner it can be shown that 

is a solution for the case r/n < 1. 

It now remains to determine the exact form of p m . Let us consider the case 
r > n, and the expansion of the expression in equation (v). Let cos = x, and 
n/r = z. Then 



The coefficient of z m is p m , and the values of this coefficient are readily obtained. 
For the low values of m they are as follows: 

3 

PO = i; PI =* x*> P2 = -7 



5-3-1/ . 3-2 

* - 



and for z m t the coefficient is 



(2ro - 1) (2ro - 3) . . . if _ w(m - 1) 



._. > ^ N- - - ' i m ^_ * i *,n 

Pm m! L 2(2w-l) 

m(m - 1) (m - 2) (m - o, ^ m _ 4 , .^ 



^ 2 4 (2m - 1) (2m - 
But this is the Legendre polynomial P m (x). Hence for n/r < 1, 

If n " 2 

F- - P (cos0) +- 1 



is a solution of Legendre's equation (60), where m is a positive integer. 
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Also 

V r w 

and V = - 



are solutions of equation (iv). 

17. Potential at a Point Due to a Charge Distribution. Let us consider the poten- 
tial due to a homogeneous sphere. Let the radius of the sphere be a, and R the 
distance of the point P from the center of the sphere (see Fig. 8). 




FIG. 8. 



/j s* 1 2 

~^ = I ~-Bi 



where a = a(r) is the charge distribution function and denotes the distance from 
any point (inside or outside the sphere) to the point P. 

Now S 2 = h 2 + R* - 2hR cos 0. 

Differentiating for a constant value of h, 



Hence 

V = Cj~dSdi>dh. 
For R > a, 



(i) 



since <r is independent of and 17. That is, for an external print the potential due 
to a sphere is the same as if the entire charge were concentrated at the center. 



For R =a, 7= -- (ii) 

o 

Since V is constant throughout a conductor the potential at a point inside the 
sphere must be the same as that at the surface. That is, for a point inside a sphere 
of given radius a, the potential is independent of the distance from the center and 
given by equation (ii). 

Let us now consider the potential at a point m due to a charge distribution <r = <r(r) 
which extends from r = 0tor=>. It follows from equations (i) and (ii) that 
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for r < ri, 

(iii) 



= I r 

ri Jo 



/oo / r \ 

and for r > ri, 7 n = I ' 47rr 2 dr. (iv) 

Jr r 

Hence the potential energy due to the whole distribution is 

47T F T l /* 

V = I *(r)r 2 dr -f 4ir I cr(r) rdr. (v) 
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